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STATISTICAL  METHODS  FOR  ANALYSIS  OF  NF  CLINICAL  DATA 


INTRODUCTION 

This  project  describes  research  in  statistical  methods  that  has  been  useful  for  statistical 
modeling  and  analysis  of  clinical  data  from  NF1  and  NF2  subjects.  The  statistical 
methods  are  classified  into  the  areas: 

(a)  estimation  of  familial  correlation  for  different  types  of  data, 

(b)  assessment  of  multi-hit  mutation  models  for  incidence  of  tumours. 

Accomplishing  the  project  required  development  of  some  new  statistical  theory,  as  well 
as  non-trivial  computer  programming  to  implement  the  new  and  existing  theory. 

One  new  statistical  inference  method  for  familial  data  is  composite  likelihood.  This 
method  is  intended  in  general  for  situations  in  which  a  multivariate  probability  is  too  time 
consuming  to  compute  because  of  numerical  integrations;  for  the  NF  data,  this  situation 
occurred  when  the  family  size  exceeded  4  to  6  (the  lower  bound  on  family 
size  depends  on  the  actual  multivariate  probability). 

Clinical  data  exist  in  many  formats  including  binary,  categorical,  count,  and  continuous 
information.  Also  common  are  censored  data  (e.g.,  only  a  lower  bound  is  known  for 
some  measurements).  One  goal  of  the  project  was  the  creation  of  a  software  package 
for  familial  data  analysis  for  different  types  of  data,  such  as  binary,  count,  and 
right-censored  survival  data.  The  modules  based  on  composite  likelihood  were  the 
most  recent  additions  to  the  package. 
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BODY 


Purpose  of  the  project: 

(A)  To  develop  statistical  methods  that  can  be  used  to  characterize  the  phenotype  of 
individuals  with  NF1  and  NF2. 

(B)  To  develop  methods  to  elaborate  on  the  standard  two-hit  model  of  tumour  formation 
taking  into  account  additional  pathogenic  factors  and  allelic  differences  for  tumours  in 
NF1  and  NF2. 

Research  Accomplishments: 

The  research  accomplishments  associated  with  each  objective  from  the  statement  of 
work  are  summarized  below. 

Objective  1.  Develop  statistical  methods  for  interval-censored  data,  and  obtain 
estimates  of  age  of  onset  distributions  for  N FI  and  NF2  features,  using  longitudinal 
information  in  the  databases. 

This  objective  is  being  postponed  as  we  currently  do  not  have  enough  longitudinal 
information  in  the  databases. 


Objective  2.  Develop  statistical  methods  for  familial  correlations  for  non-continuous 
and  censored  data,  and  obtain  estimates  of  intraclass  and  interclass  correlations  for 
quantitative  and  binary  traits  in  NF1  and  NF2. 

The  aims,  as  outlined  in  the  initial  grant  proposal,  have  now  been  mostly  completed  with 
the  completion  of  Yinshan  Zhao's  thesis  ( ftp.stat.ubc.ca:pub/hjoe/zhao/zhao04 _phd.pdf) 
plus  two  submitted  papers  to  statistics  journals.  A  third,  more  applied  paper,  will  be 
completed  soon. 

Estimation  of  familial  correlations  in  clinical  traits  is  related  to  the  assessment  of  familial 
aggregation  in  genetic  diseases.  It  is  important  to  our  understanding  of  the  causes  of 
variable  expressivity  in  Mendelian  diseases. 

We  give  a  short  overview  of  Zhao's  thesis.  This  work  involves  the  theory  for  various 
estimating  equation  approaches  that  should  lead  to  more  reasonable  computing  time  for 
estimating  familial  associations  for  traits  in  the  form  of  right-censored  survival,  binary  or 
count  data.  An  example  of  a  count  variable  is  the  number  of  tumors  of  a  particular  type; 
an  example  of  a  right-censored  survival  variable  is  the  age  of  onset 
of  a  particular  disease  feature  -  those  who  do  not  have  the  feature  are  right-censored  at 
the  age  of  last  clinical  observation. 
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For  most  of  the  multivariate  models  for  familial  data,  such  as  those  with  a  latent 
multivariate  normal  distribution,  the  maximum  likelihood  approach  is  not  computationally 
tractable  when  high-dimensional  numerical  integration  is  involved  It  was  necessary  to 
develop  other  estimating  approaches  which  are  computationally  less  demanding,  and 
relatively  efficient  (in  the  sense  of  variance  of  the  sampling  distribution  of  the  estimator). 
Estimation  using  two  types  of  composite  likelihood  equations 
were  considered;  these  are  based  on  the  likelihoods  of  the  univariate  and  bivariate 
margins  of  the  multivariate  model.  These  methods  were  shown  to  perform  well  for  a 
number  of  multivariate  models  useful  for  familial  data. 

Other  work  done  by  Lisa  Kuramoto,  a  research  assistant,  is  summarized  next. 

Genotype-phenotype  analyses,  accounting  for  the  familial  associations,  were  done  for 
some  count  variables  in  the  NF2  database;  count  variables  included  the  number  of 
spinal  tumors,  number  of  meningiomas,  and  number  of  cutaneous  schwannomas  for 
NF2  patients  (please  see  Baser  ME,  Kuramoto  L,  Joe  H,  et  al,  2004).  Further  analysis 
of  several  NF2  phenotypes  involved  making  subgroups  for  the  genotype  of  a  splice-site 
mutation;  the  splice  site  was  split  into  one  of  exon  1-5,  exon  6-10  and  exon  11-15.  The 
analyses  suggest  severity  of  disease  depended  on  splice  site  location.  Some 
conclusions  are: 

(a)  subjects  in  the  exon  1  to  5  category  seem  to  have  a  higher  prevalence  and  number 
of  meningiomas  than  subjects  in  the  other  splice  site  mutation  categories. 

(b)  subjects  in  the  exon  1 1  to  15  category  seem  to  have  a  higher  age  at  onset  than  the 
other  splice  site  mutation  categories. 

Currently,  a  manuscript  is  being  written  with  the  analyses. 


Research  by  graduate  student,  Tracy  Tucker,  concludes  work  related  to  objective  2  with 
a  clinical  application  to  an  NF1  problem:  Some  NF1  patients  appear  to  have  a  greater 
risk  than  others  for  developing  malignant  peripheral  nerve  sheath  tumors  (MPNST). 
Tucker  tested  the  hypothesis  that  MPNSTs  are  associated  with  the  presence  of  both 
subcutaneous  neurofibromas  and  internal  neurofibromas  in  NF1  patients,  but  are  not 
associated  with  cutaneous  neurofibromas.  Logistic  regressive  modeling  and  survival 
analysis  curves  of  censored  longitudinal  quantitative  and  semi-quantitative  data 
appeared  to  support  the  hypothesis  and  indicated  the  need  for  a  longitudinal  study  of 
MPNST.  (Manuscript  in  preparation). 
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Objective  3.  Fit  multi-hit  mutation  models  for  the  incidence  of  NF2  and  NF1  tumours  by 
age,  distinguish  whether  a  two-hit  or  three-hit  model  provides  a  better  fit  to  the  data,  and 
adapt  the  models  to  account  for  mutation  type  and  other  factors. 

Two-  and  three-hit  models  of  vestibular  schwannomas  were  fit  to  data  for  NF2  subjects; 
this  was  published  in  Genetic  Epidemiology  [Woods  et  al  2003].  With  the  latest  NF2 
database  with  more  data  on  mutation  type,  we  considered  the  strength  of  the 
genotype-phenotype  correlations  when  fitting  the  two-hit  and  three-hit  models  for 
vestibular  schwanomas  separately  for  several  mutation  types. 

The  analysis  suggests  that  the  age  of  onset  of  the  first  vestibular  schwannoma  is 
smaller  for  truncating  mutations  (nonsense  and  frameshift)  than  the  other  classes  such 
as  splice  site,  missense,  large  deletions. 

For  the  fitted  2-hit  model,  the  comparison  of  truncating  mutations  versus  others 
suggested  a  common  mutation  rate  but  higher  cell  growth  and  death  rates  for  the 
truncating  mutation  class.  This  interpretation  depends  on  having  the  right  order  of 
magnitude  for  the  number  of  Schwann  cell  precursors.  The  initial  check  has  been  done 
and  final  verification  is  ongoing  now.  A  manuscript  has  been  prepared  for  submission 
pending  these  final  results. 

Objective  4.  Write  C  code  to  implement  all  of  these  statistical  methods  and  provide  a 
user-friendly  interface  for  the  code. 

Software  written  in  C  and  perl,  is  being  developed  in  Unix/Linux;  it  runs  also  in  Windows 
with  Cygnus/Gnuwin  [see  www.cygwin.com],  the  public  domain  version  of  Unix  for 
Windows.  The  implementation  of  the  interface  currently  is  through  control  files  which 
specifies  parameters  and  data  files. 

The  modules  included  in  the  package  are: 

a)  for  binary  response  data,  maximum  likelihood  estimation  for  the  multivariate  probit 
and  logit  models,  and  composite  likelihood  estimation  methods  for  the  multivariate 
probit  model 

b)  for  count  data,  maximum  likelihood  estimation  for  exchangeable  correlation  form  and 
composite  likelihood  estimation  for  the  general  dependence  form  of  the  multivariate 
Poisson-lognormal  model,  and  maximum  likelihood  estimation  for  the  gamma  mixture  of 
negative  binomial  model 

c)  for  continuous  data,  maximum  likelihood  estimation  for  the  multivariate  normal  model 

d)  for  right-censored  continuous  data,  maximum  likelihood  estimation  and  composite 
likelihood  estimation  for  the  multivariate  normal  model 

The  latest  modules  are  based  the  estimation  methods  from  Zhao's  PhD  thesis  research. 
In  addition,  there  is  code  that  can  be  used  to  generate  codes  for  different  relation 
classes  (e.g,  sib-sib,  mother-offspring,  father-offspring,  parent-offsprint,  degree  2, 
degree  3  etc).  The  estimation  modules  essentially  then  estimate  a  different  correlation 
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parameter  for  different  relation  classes.  By  comparing  the  correlation  estimates  for  the 
different  relation  classes,  one  can  get  some  ideas  of  the  genetic  factors  influencing  the 
strength  of  familial  association.  The  package  has  been  designed  in  such  a  way  that 
modules  can  be  easily  added  for  different  models.  For  example,  we  are  considering  the 
multivariate  normal  copula  model  with  negative  binomial  margins  for  familial  count  data, 
and  the  multivariate  normal  copula  model  with  Weibull  margins  for  familial  onset-time 
censored  data. 

The  final  details  were  completed  by  programmer  Hongbin  Zhang  who  integrated  and 
modularized  the  code  for  different  multivariate  models  that  were  written  by  Lisa 
Kuramoto,  Yinshan  Zhao  and  me.  Specific  modules  of  the  package  were  used  in  the 
several  of  the  previous  publications  that  came  out  of  this  grant. 

The  latest  version  of  the  software  package  will  be  put  in  the  directory 
ftp://ftp.stat.ubc.ca/pub/hjoe/famil/ 

The  software  package  will  be  referred  to  in  updates  of  the  recently  submitted  papers  for 
this  project. 


The  summary  according  to  the  proposed  timeline  of  work  is  given  below. 
0-12  months : 

-  development  of  statistical  theory  for  the  simpler  cases, 

-  coding  into  C  programs  and  use  on  current  NF1/NF2  databases 

12-24  months  : 

-  extension  of  theory  to  cover  more  general  situations 

-  continuation  of  coding  and  data  analysis 

-  presentation  of  preliminary  results  in  technical  papers  and  conferences 
24-36  months  : 

-  writing  more  general  publications, 

-  conversion  of  C  code  to  a  form  with  friendlier  input  instructions,  so  that  a 
non-computer  programmer  can  use  the  computer  programs. 


The  theory  has  been  developed  for  objectives  2  and  3.  The  coding  into  C  programs  and 
use  on  current  NF1/NF2  databases  has  been  completed  for  the  statistical  methods,  and 
implemented  in  an  organized  way  into  the  software  package.  Ten  manuscripts  have 
been  published  or  accepted  for  publication,  three  others  have  been  submitted,  and  four 
more  are  in  draft.  A  total  of  fourteen  presentations  were  made,  or  will  be  made,  at  the 
2000-2004  meetings  of  the  American  Society  of  Human  Genetics,  the  2003  and  2004 
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NNFF  International  Consortium  for  the  Molecular  Biology  of  NFI  and  NF2,  in  Aspen, 
Colorado,  and  the  2003  European  Neurofibromatosis  meeting,  in  Turku,  Finland. 

We  fell  one  year  behind  the  original  schedule  because  of  the  theory  in  the  PhD 
research;  in  advance  it  is  hard  to  predict  how  quickly  PhD  students  can  accomplish 
things,  thus  the  2003-2004  year  was  a  no-cost  extension. 
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KEY  RESEARCH  ACCOMPLISHMENTS 

"  Comparison  of  two-  and  three-hit  models  for  onset  time  of  vestibular 
schwannomas  for  NF2  subjects. 

"  Development  of  new  statistical  theory  and  new  multivariate  models  for  analysis  of 
NF1  and  NF2  phenotype  data  that  are  discrete  or  censored. 

"  Completion  of  a  software  package  for  analysis  of  familial  data  of  various  types 
(binary,  count,  continuous,  censored).  The  software  written  in  the  C 
programming  languages,  developed  in  Unix/Linux,  runs  also  in  Windows  with 
Cygnus/Gnuwin  (public  domain  version  of  Unix  for  Windows). 
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2003,  Los  Angeles). 

Tucker  T,  Wolkenstein  P,  Friedman  JM. 

Association  Between  the  Occurrence  of  Malignant  Peripheral  Nerve  Sheath  Tumours 
and  Subcutaneous  and  Internal  Neurofibromas  in  NF1 

(Presentation  to  10th  European  Neurofibromatosis  Meeting,  Turku,  Finland,  August 
2003) 

Zhao  Y. 

Composite  Likelihood  Estimation  in  Familial  Data  Analysis 

(Presentation  to  Biometrics  (WNAR)  meeting  in  Albuquerque,  New  Mexico:  June 

26-June  30,  2004). 
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Abstracts  and  Presentations  Prior  to  Aug  1 , 2003 

(Full  details  in  the  August  2001 , 2002  and  2003  Annual  Reports) 

Baser  ME,  Woods  R,  Joe  H,  Kuramoto  L,  Friedman  JM,  Wallace  AJ,  Bijlsma  E, 
Olschwang  S,  Papi  L,  Parry  DM,  Ramsden  RT,  Rouleau  GA,  Evans  DGR.  The  location 
of  constitutional  neurofibromatosis  2  ( NF2 )  splice-site  mutations  is  associated  with  the 
number  of  intracranial  meningiomas:  results  from  an  international  NF2  database. 
(Presented  at  the  NNFF  International  Consortium  for  the  Molecular  Biology  of  NF1  and 
NF2, 1-4  June  2003,  Aspen  CO). 

Baser  ME,  Joe  H,  Kuramoto  L,  Friedman  JM,  Wallace  AJ,  Ramsden  RT,  Evans  DGR. 
Genotype-phenotype  correlations  for  cataracts  in  neurofibromatosis  2. 

(Presented  at  the  NNFF  International  Consortium  for  the  Molecular  Biology  of  NF1  and 
NF2, 1-4  June  2003,  Aspen  CO). 

Baser  ME,  Parry  DM,  Joe  H,  Kuramoto  L,  Friedman  JM,  Gillespie  JE,  Wallace  AJ, 
Ramsden  RT,  Evans  DGR.  Genotype-phenotype  correlations  for  spinal  tumors  in 
neurofibromatosis  2. 

(Presented  at  the  NNFF  International  Consortium  for  the  Molecular  Biology  of  NF1  and 
NF2,  1-4  June  2003,  Aspen  CO). 

Baser  ME,  Joe  H,  Kuramoto  L,  Friedman  JM,  Gillespie  JE,  Wallace  AJ,  Ramsden  RT, 
Evans  DGR. 

Genotype-phenotype  correlations  for  peripheral  nerve  tumors  in  neurofibromatosis  2. 
(Presented  at  the  NNFF  International  Consortium  for  the  Molecular  Biology  of  NF1  and 
NF2, 1-4  June  2003,  Aspen  CO). 

Baser  ME,  Friedman  JM,  Joe  H,  Wallace  AJ,  Ramsden  RT,  Evans  DGR. 
Genotype-phenotype  correlations  for  presenile  cataracts  in  neurofibromatosis  2. 

Am  J  Hum  Genet 71  (Suppl):428,  2002. 

(Presented  at  the  American  Society  of  Human  Genetics  52nd  Annual  Meeting,  2002 
October  15-19,  Baltimore). 

Palmer  C,  SzudekC,  Joe  H,  Riccardi  VM,  Friedman  JM. 

The  development  of  cutaneous  neurofibromas  is  influenced  by  familial  and  local  factors 
in  patients  with  Neurofibromatosis  1  (NF1). 

Am  J  Hum  Genet  67  (Suppl):677,  2000. 

(Presented  at  the  American  Society  of  Human  Genetics  50th  Meeting,  Philadelphia, 
October,  2000) 
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Szudek  J,Joe  H,  J.M.  Friedman  JM. 

Familial  aggregation  of  neurofibromatosis  1  (NF1)  clinical  features. 

Am  J  Hum  Genet 67  (Suppl):1 137,  2000. 

(Presented  at  the  American  Society  of  Human  Genetics  50th  Meeting,  Philadelphia, 
October,  2000) 

Tzenova  J,  Joe  H,  Riccardi  VM,  Friedman  JM.  The  effect  of  parental  age  on  the 
occurrence  of  neurofibromatosis  1 . 

Am  J  Hum  Genet  69  (Suppl):393,  2001 . 

(Presented  at  the  American  Society  of  Human  Genetics  51st  Annual  Meeting,  San 
Diego,  October  2001). 

Zhao  Y,  Kumar  RA,  Baser  ME,  Evans  DGR,  Wallace  A,  Rouleau  GA,  Mautner  VF, 
Kluwe  L,  Joe  H,  Friedman  JM.  Allele  class-independent  intrafamilial  correlation  of  age 
at  onset,  age  at  hearing  loss  and  number  of  intracranial  meningiomas  in 
neurofibromatosis  2  (NF2). 

Am  J  Hum  Genet 69(Suppl):420,  2001. 

(Presented  at  the  American  Society  of  Human  Genetics  51st  Annual  Meeting,  San 
Diego,  October  2001). 
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Degrees  Supported  in  Whole  or  Part  by  this  Award. 

Zhao,  Yinshan  (2004).  Statistical  modelling  and  inference  for  discrete  and  censored 
familial  data.  PhD  thesis,  Department  of  Statistics,  University  of  British  Columbia. 
[Defense  in  February  2004,  submitted  to  University  in  March  2004,  available  at 
ftp.stat.ubc.ca:pub/hjoe/zhao/zhao04_phd.pdf] 

Lee,  Bernard  (2003).  Phylogenetic  Footprinting  of  the  NF1  5  Upstream  Region  (5UR). 
MSc  thesis,  Department  of  Medical  Genetics,  University  of  British  Columbia, 

(Please  see  details  in  2003  annual  report) 

Aeschliman,  Dana  (2001).  Survival  Times  of  NF2  Patients.  MSc  thesis,  Department  of 
Statistics,  University  of  British  Columbia 
(Please  see  details  in  2002  annual  report.) 

Szudek,  Jacek.  (2001).  Analysis  of  variable  expressivity  in  neurofibromatosis  1.  Ph.D. 
thesis,  Department  of  Medical  Genetics,  University  of  British  Columbia, 

(Please  see  details  in  2001  annual  report) 

Latif,  A.  H.  Md.  M.  (2001).  A  comparison  of  methods  for  multivariate  binary  response 
with  simulation  studies.  MSc  thesis,  Department  of  Statistics,  University  of  British 
Columbia, 

(Please  see  details  in  2001  annual  report) 

Woods,  Ryan.  (2000).  Models  for  the  development  of  tumours  in  neurofibromatosis  2. 
MSc  thesis,  Department  of  Statistics,  University  of  British  Columbia,  August  2000. 
(Please  see  details  in  2001  annual  report) 

Thesis  in  Progress: 

Tucker,  Tracy  (PhD  candidate).  Neurofibroma  formation  mechanisms  in  NF1. 
Department  of  Medical  Genetics,  University  of  British  Columbia. 

(Please  see  this  report  for  details  of  some  of  her  work). 
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CONCLUSIONS 


The  goals  of  this  project  were  to  develop  statistical  methods  that  can  be  used  to 
characterize  the  complex  phenotypes  of  NF1  and  NF2  and  to  model  the  mechanisms  of 
tumor  formation. 

We  have  successfully  developed  new  methods  to  handle  various  data  types,  including 
phenotypic  data  (of  various  formats  such  as  binary  or  censored),  genetic  data,  and  data 
for  individuals  and  families.  The  methods  have  been  rigorously  tested  mathematically 
and  software  has  been  developed  to  make  the  methods  generally  available  to  other 
researchers. 

Beyond  the  development  of  new  statistical  techniques,  is  their  clinical  application. 
Working  in  conjunction  with  genetic  epidemiologists  and  clinicians  in  British  Columbia, 
California,  and  the  UK,  we  have  had  access  to  clinical  data  for  both  modeling  and 
testing.  For  example,  by  incorporating  techniques  for  handling  survival  data  into  a 
random  effects  model  and  using  a  mixture  model  for  count  data,  we  have  been  able 
to  estimate  intra-familial  correlations  in  continuous  variables  such  as  age  at  onset  of 
hearing  loss  (right  censored),  and  number  of  intra-cranial  meningiomas  in  NF2  (Zhao  et 
al,  2002).  Refining  these  calculations  by  taking  into  account  the  nature  of  the  mutation, 
these  statistical  methods  allow  clinicians  to  make  individual  and  familial  predictions 
which  are  useful  in  genetic  counselling.  Further,  the  findings  of  such  clinical  studies 
may  ultimately  lead  to  an  understanding  of  the  mechanism  whereby  genetic  mutations 
cause  the  various  symptoms  of  NF1  or  NF2 

Genotype-phenotype  correlations  are  also  useful  in  genetic  counselling  or  in  identifying 
particularly  high  or  low  risk  individuals  for  screening  decisions.  We  have  demonstrated 
such  techniques  in  NF1  and  NF2.  For  example,  in  Baser,  Kuramato,  Joe  et  al,  2003, 
we  use  a  multivariate  probit  model  to  examine  the  relative  risk  of  cataract  formation. 
Similar  methods  have  demonstrated  that  intracranial  meningiomas,  spinal  tumors,  and 
peripheral  nerve  sheath  tumours  are  also  influenced  by  the  type  of  NF2  mutation. 

We  have  also  contributed  to  the  understanding  of  the  pathogenetics  of  tumour  formation 
by  modeling  tumor  development  in  vestibular  schwannomas  (Woods  et  al  2003):  A 
traditional  model  for  tumor  formation,  the  two-hit  model,  suggest  that  two  mutations 
are  necessary  before  a  tumor  forms.  In  genetic  conditions  such  as  NF1  orNF2,  where 
affected  people  are  born  with  one  mutation  in  a  tumor  suppressor  gene,  the  two-hit 
model  suggests  that  one  additional  mutation  may  be  sufficient  for  tumor  formation.  We 
developed  various  models  to  test  this  hypothesis  in  vestibular  schwannoma 
development  in  people  with  NF2.  By  fitting  models  to  NF2  patient  data,  we  were  able  to 
show  that  a  three-hit  model  may  well  provide  a  better  solution.  These  techniques  can  in 
future  be  applied  to  other  tumours  such  as  meningiomas. 
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Our  work  with  NF1  (for  example,  Szudek  et  al,  2002)  has  enabled  us  to  make 
predictions  regarding  the  genetic  cause  of  many  of  the  features  of  this  complex 
disorder.  While  controlling  for  gender  and  age  we  were  able  to  look  for  patterns  of 
associations.  We  examined  familial  aggregation  for  a  number  of  symptoms  in  different 
classes  of  relatives  (e.g.,  parent-child,  siblings,  second  degree  relatives).  Our  analyses 
indicate  that  some  of  the  symptoms  appear  to  be  influenced  by  functional 
polymorphisms  of  the  normal  NF1  allele  whereas  others  are  influenced  by  modifying 
genes,  and  others  by  the  abnormal  NF1  gene  itself.  More  than  one  genetic  mechanism 
may  well  be  influencing  the  phenotype  simultaneously,  and  studies  with  larger  families, 
with  more  uniformly  collected  data  and  in  more  diverse  populations  would  help  to  clarify 
these  mechanisms.  Molecular  genetic  techniques,  using  SNPs  or  other  methods,  could 
be  used  in  combination  with  these  statistical  methods  to  facilitate  the  identification  of 
such  putative,  modifying  loci. 

The  methods  we  have  developed  and  refined  are  useful  for  other  genetic  conditions, 
particularly  where  there  is  a  complex  phenotype  or  no  obvious  genotype-phenotype 
correlation.  We  have  contributed  openly  available,  well-documented  code  which  will  be 
useful  to  other  statisticians  and  genetic  epidemiologists  who  are  interested  in  learning 
about  the  pathogenesis  of  inherited  genetic  disorders.  Further,  we  have  contributed  a 
body  of  new  publications  characterizing  the  phenotypes  of  both  NF1  and  NF2. 
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APPENDIX  1 


Abstracts  accepted  or  presented  after  August  1 , 2003  (Please  refer  to  the  annual 
reports  for  full  text  of  the  previous  abstracts) 


Longitudinal  vestibular  schwannoma  growth  rates  in  people  with  neurofibromatosis  2 
(NF2):  a  combined  analysis  of  NF2  patients  from  three  countries.  M.E.  Baser,1  T. 
Patankar,2  L.  Kluwe,3  V.-F.  Mautner,4E.  Makariou,5D.M.  Parry,6  A.J.  Wallace,7  R.T. 
Ramsden,8D.G.R.  Evans,7  A.  Jackson2 

^os  Angeles,  CA,  U.S.A.,  2Division  of  Imaging  Science  and  Biomedical  Engineering, 
University  of  Manchester,  Manchester,  U.K.,  department  of  Neurosurgery,  University 
Hospital  Eppendorf,  Hamburg,  Germany,  department  of  Neurology,  Klinikum  Nord 
Ochsenzoll,  Hamburg,  Germany,  department  of  Radiology,  Georgetown  University 
Medical  Center,  Washington,  D.C.,  U.S.A.,  6Genetic  Epidemiology  Branch,  National 
Cancer  Institute,  Bethesda,  MD,  U.S.A.,  7University  Department  of  Medical  Genetics,  St. 
Marys  Hospital,  Manchester,  U.K.,  department  of  Otolaryngology,  Manchester  Royal 
Infirmary,  Manchester,  U.K. 

(Accepted  for  presentation:  American  Society  of  Human  Genetics,  54th  Annual  Meeting, 
Toronto,  October,  2004) 

In  two  previous  longitudinal  studies  of  vestibular  schwannoma  (VS)  growth  rates  in 
people  with  neurofibromatosis  2  (NF2),  we  found  that  growth  rates  tended  to  decrease 
with  increasing  age  and  that  genotype-phenotype  correlations  were  not  apparent.  To 
more  definitively  address  these  questions,  we  assessed  VS  growth  rates  in  84  NF2 
patients  from  the  United  States,  Germany,  and  the  United  Kingdom.  The  median  follow¬ 
up  was  3.9  years  (range,  0.2  to  14.8  years).  Box  models  were  used  to  estimate  VS 
volumes  and  linear  regression  was  used  to  evaluate  the  association  between  tumor 
doubling  time  (TDT,  in  years)  and  covariates.  VS  growth  rates  decreased  with 
increasing  age  at  baseline  (log10  TDT  =  .834  +  .008  x  age  at  baseline,  SEb  =  .004); 
people  aged  <15  years  at  baseine  had  the  highest  growth  rates.  VS  growth  rates  were 
not  significantly  different  between  people  with  constitutional  NF2  splice-site  mutations 
(N  =  19)  and  those  with  nonsense  or  frameshift  mutations  (N  =  26);  there  were  too  few 
other  types  of  mutations  to  analyze  separately.  To  estimate  the  effect  of  volumetric 
accuracy  on  growth  rates,  we  used  two  methods  (one-  and  two-component  box  models) 
to  estimate  volumes  and  TDT  for  26  VSs  that  had  intracanalicular  and  extracanalicular 
portions,  which  are  measured  separately  in  the  two-component  model.  One-component 
baseline  VS  volumes  were  greater  than  two-component  volumes  by  a  median  of  19%, 
and  one-component  TDTs  were  less  than  two-component  TDTs  by  a  median  of  12%. 
These  results  suggest  that  different  three-dimensional  VS  volumetric  methods  have  a 
relatively  small  average  effect  on  VS  growth  rates,  possibly  because  the  magnitude  of 
long-term  longitudinal  VS  volume  change  is  often  far  greater  than  the  magnitude  of  VS 
volume  measurement  error. 
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The  epidemiology  of  neurofibromatosis  2  (NF2).  M.E.  Baser,1  J.M.  Friedman,2  H.  Joe,3 
D.G.R.  Evans4 

^os  Angeles,  CA,  U.S.A.,  department  of  Medical  Genetics  and  department  of 
Statistics,  University  of  British  Columbia,  Vancouver,  B.C.,  Canada,  "University 
Department  of  Medical  Genetics,  St.  Marys  Hospital,  Manchester,  U.K. 

(Presentation  at  the  NNFF  International  Consortium  For  The  Molecular  Biology  of  NF1, 
NF2  and  SchwannomanotsisNF,  Aspen,  Colorado,  May  2004) 

Research  on  the  epidemiology  of  NF2  has  been  limited  by  the  rarity  of  the  disease. 
Recently,  there  have  been  longitudinal  studies  of  vestibular  schwannoma  (VS)  growth 
rates  in  NF2  and  the  risk  of  mortality  in  NF2.  Additional  longitudinal  studies  are  in 
progress  on  the  development  of  symptoms  and  their  relationship  to  the  risk  of  mortality, 
and  modeling  patterns  of  VS  growth  rates.  Other  studies  are  based  on  two  large  NF2 
databases.  The  United  Kingdom  NF2  registry  is  being  used  for  several 
studies:  genotype-phenotype  correlations  for  each  common  type  of  NF2-associated 
nervous  system  tumor,  empirical  evaluation  and  development  of  diagnostic  criteria  for 
NF2,  and  two-  and  three-hit  mathematical  models  of  VS  formation  that  include 
genotype-phenotype  correlations.  The  international  NF2  database  has  all  published 
constitutional  NF2  mutations,  data  from  the  United  Kingdom  NF2  registry,  unpublished 
data  that  are  contributed  by  investigators  around  the  world,  and  summary  clinical 
information  (as  well  as  all  published  somatic  NF2  mutations  in  non-heritable  tumors). 
The  international  NF2  database  is  being  used  to  study  genotype-phenotype  correlations 
for  the  location  of  constitutional  NF2  spiice-site  mutations.  A  description  of  the 
international  NF2  database  and  its  comprehensive  list  of  constitutional  NF2  mutations  is 
being  distributed  to  on-line  human  gene  mutation  databases. 
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Lee  Bernard  TK,  Friedman  JM. 

Phylogenetic  Footprinting  of  the  NF1  5  Upstream  Region  (5UR). 

(Presented,  53rd  Annual  Meeting,  American  Society  of  Human  Genetics,  November 
2003,  Los  Angeles). 

Medical  Genetics,  University  of  British  Columbia,  Vancouver,  BC,  Canada. 

The  5UR  of  the  human  neurofibromatosis  1  gene  was  defined  as  the  59756  bp  region 
between  the  NF1  translation  start  site  and  the  end  of  the  first  upstream  GenScan 
prediction  (NT_01 0799.1 14).  The  5URs  of  mouse  and  rat  were  defined  as  59756  bp 
upstream  of  the  translation  start  site  of  the  NF1  homologs  in  these  species.  The  5UR  in 
Fugu  was  defined  as  the  1488  bp  segment  between  the  known  5  flanking  gene  (FN5) 
and  the  NF1  translation  start  site.  Sequence  alignments  were  established  by  mVista, 
and  windows  of  identity  greater  than  that  of  the  coding  regions  and  extending  50  bp  or 
more  in  length  among  all  3  mammalian  species  were  identified  with  Frameslider,  a  Perl 
program  written  for  this  research.  These  highly  homofogous  regions  (HHRs)  were 
compared  to  the  Fugu  5UR  using  Pairwise  BLAST  and.  analyzed  for  potential 
transcription  factor  binding  sites  and  other  promoter-associated  sequences  with 
MATCH,  Matlnspector,  Eurkaryotic  Promoter  Database  and  TRRD. 

Three  HHRs  were  discovered  in  the  NF1  5UR.  HHR1 ,  located  42626-42696  bp 
upstream  of  translation  start  site,  contains  an  AP-1  site  shared  by  all  four  species. 
HHR2,  located  640-689  bp  upstream  of  translation  start  site,  has  no  promising 
predictions  for  recognized  transcription  factor  binding  sites.  HHR3,  located  233-519  bp 
upstream  of  the  NF1  translation  start  site,  contains  a  previously-described  CREB  site 
that  is  shared  by  all  three  mammalian  species. 

HHR3  also  includes  a  24  bp  sequence  310-333  bp  upstream  of  the  translation  start 
that  is  identical  in  human,  mouse  and  rat  and  differs  by  onlyl  bp  in  Fugu.  Bioinformatic 
analysis  and  correlation  with  previously-published  in  vitro  transcription  studies  indicate 
that  this  sequence,  which  we  call  NF1  Highly  Conserved  Sequence  (NF1HCS),  is  likely 
to  be  involved  in  transcriptional  regulation.  NF1HCS  lies  151  bp  downstream  from  the 
NF1  major  transcriptional  start  site  but  appears  to  be  a  strong  candidate  for  the  NF1 
core  promoter  element  despite  its  position  further  downstream  than  any 
previously-described  eukaryotic  downstream  core  promoter  element. 


21 


* 


Ramsden  RT,  Evans  DGR,  Wallace  AJ,  Joe  H,  Baser  ME. 

Revised  diagnostic  criteria  for  neurofibromatosis  2. 

(Presented  at  the  53rd  Annual  Meeting,  American  Society  of  Human  Genetics,  4-8 
November  2003,  Los  Angeles,  CA) 

We  reported  that  each  of  the  four  sets  of  clinical  diagnostic  criteria  for  neurofibromatosis 
2  (NF2)  had  low  sensitivity  at  the  time  of  the  initial  assessment  (Neurology 
2002;59:1579-1565).  The  purpose  of  this  study  was  to  determine  the  extent  to  which 
modifications  to  the  Manchester  diagnostic  criteria  increased  sensitivity.  The  study  had 
221  NF2  patients  in  the  United  Kingdom  NF2  registry  who  presented  without  bilateral 
vestibular  schwannomas  (155  people  who  did  not  have  a  family  history  of  NF2  at  initial 
assessment  and  66  inherited  cases).  The  modifications  were:  (1 )  in  people  without  a 
family  history  of  NF2,  permitting  the  diagnosis  when  there  are  multiple  meningiomas 
and  only  one,  instead  of  two,  other  tumors  or  cataract  (as  in  the  NNFF  criteria);  in 
people  with  a  1st  degree  relative  with  NF2,  permitting  the  diagnosis  when  there  is  only 
one,  instead  of  two,  tumors  or  cataract  (as  in  the  1991  NIH  criteria),  but  restricting  1st 
degree  relatives  to  parents,  (2)  adding  juvenile  mononeuropathy  (<  1 5  years)  as  a 
diagnostic  criterion,  (3)  in  addition  to  clinical  criteria,  permitting  the  diagnosis  when 
constitutional  NF2  mutations  are  identified.  We  used  Kaplan-Meier  analysis  to 
determine  the  time  course,  from  initial  assessment  to  the  most  recent  clinical  evaluation, 
of  the  increasing  proportion  of  people  who  would  be  diagnosed  with  NF2  using  the 
Manchester  criteria  and  the  three  modifications;  the  jackknife  method  was  used  to 
compute  pointwise  standard  errors  for  differences  in  proportions  of  pairwise  Kaplan- 
Meier  curves  between  different  sets  of  criteria.  In  people  without  a  family  history  of  NF2 
at  initial  assessment  (the  most  difficult  group  to  diagnose),  sensitivity  was  increased  by 
incorporating  features  of  the  1991  NIH  criteria  and  the  NNFF  criteria  (modification  1 
above).  The  modified  Manchester  criteria  were  significantly  more  sensitive  than  the 
original  Manchester  criteria  from  four  years  after  initial  assessment  to  17  years  after 
initial  assessment.  In  inherited  cases,  sensitivity  was  further  increased  by  adding 
mononeuropathy  as  a  diagnostic  criterion  and  incorporating  the  positive  results  of 
mutation  analysis.  These  results  indicate  that,  in  NF2  patients  who  present  without 
bilateral  vestibular  schwannomas,  modifications  to  the  Manchester  criteria  can  increase 
diagnostic  sensitivity,  although  not  at  the  time  of  the  initial  assessment. 
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Tucker  T,  Wolkenstein  P,  Friedman  JM. 

Association  Between  the  Occurrence  of  Malignant  Peripheral  Nerve  Sheath  Tumours 
and  Subcutaneous  and  Internal  Neurofibromas  in  NF1 

Department  of  Medical  Genetics,  University  of  British  Columbia,  Vancouver,  Canada; 
and  Department  of  Dermatology,  Hopital  Henri-Mondor 

(Presentation  to  10th  European  Neurofibromatosis  Meeting,  Turku,  Finland,  Aug.,  2003) 

Individuals  with  NF1  have  a  lifetime  risk  of  about  10%  of  developing  a  malignant 
peripheral  nerve  sheath  tumour  (MPNST).  Such  tumours  are  one  of  the  most  frequent 
causes  of  death  among  people  with  NF1 .  Clinical  experience  suggests  that  most 
MPNSTs  develop  from  pre-existing  plexiform  neurofibromas.  The  number,  size,  and 
type  of  benign  neurofibromas  vary  greatly  among  NF1  patients,  but  it  is  not  known  if  an 
individual's  risk  for  developing  MPNST  bears  any  relationship  to  the  burden  of  benign 
neurofibromas. 

The  Henri-Mondor  Database  contains  information  on  516  probands  with  NF1  diagnosed 
by  the  NIH  Criteria.  The  average  age  was  35.0,  and  64%  were  between  10  and  39 
years  of  age.  44%  were  males  and  56%  females.  A  semi-quantitative  estimate  of  the 
number  of  cutaneous  neurofibromas  and  information  on  the  presence  or  absence  of  1 
or  more  subcutaneous  neurofibromas,  internal  neurofibromas  and  MPNSTs  were 
recorded  for  each  patient.  Internal  tumours  where  identified  by  screening  the  abdomen 
by  ultrasound,  CT  or  MRI  examination  and  by  screening  the  thorax  by  X-ray,  CT  or  MRI 
examination.  MPNSTs  were  diagnosed  by  biopsy.  We  used  logistic  regression  to 
determine  the  association  between  each  of  these  features;  age  was  treated  as  a 
covariante.  7.6%  of  these  patients  had  a  MPNST,  28%  had  one  or  more  internal 
plexiform  neurofibromas,  and  48%  had  one  or  more  subcutaneous  neurofibromas.  The 
distribution  of  cutaneous  neurofibromas  was  17%  with  10  or  fewer,  38%  with  11-100, 
and  31%  with  more  than  100.  MPNSTs  were  14.5  times  (95%  confidence  interval  [Cl] 
2.7-79.0)  more  likely  to  be  present  in  individuals  with  internal  plexiform  neurofibromas 
than  in  patients  without  internal  neurofibromas.  Individuals  with  subcutaneous 
neurofibromas  were  5.3  (Cl  2.1 -13.8)  times  as  likely  to  have  internal  plexiform 
neurofibromas  and  3.4  (Cl  1 .3-8.8)  times  as  likely  to  have  a  MPNST  as  patients  without 
subcutaneous  neurofibromas. 

No  association  was  observed  between  the  estimated  number  of  cutaneous 
neurofibromas  and  the  presence  of  subcutaneous  neurofibromas,  internal 
neurofibromas,  or  MPNSTs.  The  observation  that  MPNSTs  are  associated  with 
subcutaneous  neurofibromas  and  strongly  associated  with  internal  neurofibromas 
suggests  that  individuals  with  such  benign  tumours  may  warrant  increased  surveillance 
for  MPNST. 

We  are  grateful  to  the  Henri-Mondor  Database  Participants  who  contributed 
the  data  analyzed  in  this  study. 
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Composite  Likelihood  Estimation  in  Familial  Data  Analysis 

(Presentation  to  Biometrics  (WNAR)  meeting  in  Albuquerque,  New  Mexico:  June 

26-June  30,  2004) 

We  propose  two  estimation  procedures  based  on  composite  likelihoods  for  multivariate 
models  for  which  likelihood  calculations  are  too  time-consuming.  We  especially  focus 
on  two  classes  of  models  for  familial  data:  the  multinormal  random  effects  models  and 
the  multinormal  copula  models.  The  parameters  in  the  models  under  consideration  can 
be  classified  as  univariate  marginal  parameters  and  dependence  parameters.  The  first 
method  is  a  two-stage  method  in  which  the  univariate  marginal  parameters  are 
estimated  based  on  the  sum  of  log  likelihoods  of  univariate  margins  and  the 
dependence  parameters  are  estimated  separately  based  on  the  sum  of  log  likelihoods 
of  bivariate  margins  with  the  univariate  marginal  parameters  replaced  by  their 
estimates.  In  the  second  method,  all  the  parameters  are  estimated  from  the  weighted 
sum  of  log  likelihoods  of  bivariate  margins.  The  composite  likelihood  methods  can 
greatly  reduce  computation  in  parameter  estimation,  but  with  a  price  of  efficiency  loss. 
For  some  special  cases,  we  compared  the  asymptotic  efficiency  of  these  two  methods 
with  the  maximum  likelihood  method.  We  found  that  the  performance  of  the  two 
methods  is  reasonable,  except  that  when  the  dependence  is  strong,  the  first  approach  is 
inefficient  for  the  regression  parameters.  We  also  find  that  the  second  approach  is 
generally  better  for  the  regression  parameters,  but  less  efficient  for  the  dependence 
parameters  when  the  dependence  is  weak. 
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Computations  for  the  familial  analysis  of  binary  traits 

Harry  Joe  and  A.  H.  M.  Mahbub-ul  Latif 
Department  of  Statistics,  University  of  British  Columbia 
and  Abteilung  Medizische  Statistik,  Georg- August-Universitat  Gottingen 

Summary 

For  familial  aggregation  of  a  binary  trait,  one  method  that  has  been  used  is  the  GEE2  (generalized  esti¬ 
mating  equation)  method  corresponding  to  a  multivariate  logit  model  We  solve  the  complex  estimating 
equations  for  the  GEE2  method  using  an  automatic  differentiation  software  which  computes  the  derivatives 
of  a  function  numerically  using  the  chain  rule  of  the  calculus  repeatedly  on  the  elementary  operations  of  the 
function.  Based  on  this,  we  are  able  to  show  in  a  simulation  study  that  the  GEE2  estimates  are  quite  close  to 
the  maximum  likelihood  estimates  assuming  a  multivariate  logit  model,  and  that  GEE2  is  computationally 
faster  when  the  dimension  or  family  size  is  larger  than  four. 

Keywords:  binary  response,  multivariate  probit  and  logit  models,  generalized  estimating  equations,  familial 
aggregation 

1.  Introduction 

In  quantitative  genetics  and  epidemiology,  researchers  are  often  interested  in  identifying  important  variables 
or  traits  related  to  a  genetic  disease  and  also  in  familial  aggregation.  The  response  variables  measured  for 
the  disease  can  be  discrete  or  continuous.  In  this  paper,  we  focus  on  binary  traits,  such  as  presence/absence 
of  a  disease,  and  presence/absence  of  a  symptom/feature  of  a  genetic  disease.  For  familial  aggregation,  for 
genetic  hypotheses  one  would  like  to  know  the  strength  of  dependence  for  different  relationships  in  a  family 
such  as  parent-offspring,  sib-sib,  degree  2  relationship  (Falconer  1989).  Our  research  is  motivated  by  binary 
traits  in  the  genetic  disease  neurofibromatosis;  see  Szudek  et  al.  (2002). 

One  method  for  multivariate  binary  data  is  the  GEE2  odds  ratio  regression  or  multivariate  logit  model. 
Liang,  Zeger  and  Qaqish  (1992),  Molenberghs  and  Lesaffre  (1994),  Glonek  and  McCullagh  (1995)  and 
Joe  (1997)  considered  this  method/model  from  different  points  of  view.  GEE2  odds  ratio  regression  corre¬ 
sponds  to  the  multivariate  logit  model  with  a  multivariate  Plackett  distribution  (Molenberghs  and  Lesaffre 
1994;  Joe  1997).  The  formulation  of  Liang  and  Beaty  (1991),  and  Liang,  Zeger  and  Qaqish  (1992)  esti¬ 
mates  regression  and  dependence  (log  odds  ratio)  parameters  from  a  set  of  estimating  equations  without 
considering  whether  there  is  a  probability  model  behind  their  assumptions. 

The  multivariate  probit  model  (Ashford  and  Sowden  1970,  Mendell  and  Elston  1974)  is  motivated  for 
a  binary  trait  based  on  a  (latent)  polygenic  effect,  so  for  familial  analysis  of  a  binary  trait,  it  is  more  in- 
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terpretable  than  a  multivariate  logit  model.  Although  odds  ratios  have  a  convenient  interpretation,  there  is 
no  physical  or  stochastic  model  that  leads  to  the  odds  ratio  as  a  natural  dependence  parameter.  Joe  (1997) 
considers  the  multivariate  probit  and  multivariate  logit  models  as  multivariate  analogues  of  the  univariate 
probit  and  logit  latent  variable  family  of  models. 

In  this  paper,  we  compare  GEE2  and  maximum  likelihood  (ML)  estimates  assuming  a  multivariate  logit 
model.  The  computation  of  GEE2  estimates  includes  the  novel  use  of  automatic  differentiation  software  to 
solve  the  set  of  estimating  equations  which  are  not  simple  in  form.  The  previous  code  for  the  GEE2  method 
was  not  general  enough  for  binary  familial  data,  such  as  that  used  in  Szudek  et  al.  (2002),  that  are  not  in  the 
simpler  interclass/intraclass  format. 

The  organization  of  the  remainder  of  the  paper  is  as  follows.  The  models  are  specified  as  latent  vector 
models  in  Section  2.  Computational  details  are  given  in  Section  3  and  comparisons  are  made  in  Section  4. 
Section  5  concludes  with  a  discussion. 

2.  Models  and  methods  for  a  binary  trait 

For  a  binary  response  variable  Y,  with  covariate  vector  x,  common  statistical  methods  are  logistic  and  probit 
regression.  Both  of  these  methods  are  latent  variable  methods  with  the  probabilistic  representation  is: 

y  =  I(Z<a  +  j8'x),  (2.1) 

where  Z  is  standard  normal  (logistic)  for  the  probit  (logit)  regression  model.  The  model  (2.1)  can  be  written 
as 

Y  =  I(X>r ),  I^-Z  +  a  +  ^x  +  T  with  location  a-f/3'x  +  r 

where  X  is  the  liability  and  r  is  the  threshold.  One  can  apply  the  Central  Limit  Theorem  to  arrive  at  the 
probit  model  when  the  liability  is  influenced  by  the  additive  effects  of  many  genes.  The  logistic  density  is 
also  bell-shaped,  so  probabilistic  properties  of  logistic  and  probit  regression  are  similar. 

For  familial  data,  the  binary  response  vector  is  (Yi, . . . ,  Yd),  where  d  is  the  family  size.  For  a  model 
for  familial  aggregation  of  a  binary  trait,  one  needs  to  define  a  joint  probability  distribution  for  (Yi, . . . ,  Yd) 
for  d  >  2,  where  the  dependence  parameter  of  responses  (Yi,  Yj )  for  two  different  members  of  a  family, 
depends  on  the  relation  type. 

Probit  regression  model  easily  extends  to  the  multivariate  probit  model,  with  a  latent  multivariate  normal 
random  vector.  The  extension  of  logistic  regression  to  its  multivariate  counterpart  requires  a  way  to  define  a 
multivariate  logistic  distribution  with  suitable  dependence  parameters.  The  extensions  are  explained  below. 

The  multivariate  probit  model  has  been  known  for  a  long  time  (e.g.,  Ashford  and  Sowden  1 970,  Mendell 
and  Elston  1974).  The  stochastic  representation  of  the  model,  with  common  regression  parameters  for  each 
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margin,  is 


Yj  =  I(Zj  <  a  +  /3'xj),  j  =  1, . . .  ,d,  (Zi, . . . ,  Zd)  ~  iV(0,  i?d) 


(2.2) 


where  is  a  correlation  matrix.  For  models  for  familial  aggregation,  can  have  one  or  more  correlation 
parameters.  For  example,  for  the  exchangeable  dependence  model  there  is  a  single  correlation  parameter 
p;  for  a  model  for  nuclear  families,  there  are  three  parameters:  correlations  ppp:  pss  and  ppo  for  parent- 
parent,  sib-sib,  and  parent-offspring  respectively;  for  a  model  with  members  in  several  generations,  there 
may  be  further  correlation  parameters  for  second  and  higher  degree  relatives. 

For  logistic  regression,  the  regression  parameter  (3  for  a  binary  covariate  x  has  an  odds  ratio  interpre¬ 
tation.  For  a  pair  (Yi,>2)>  one  can  use  the  bivariate  Plackett  distribution  which  has  an  odds  ratio  as  a 
dependence  parameter: 

Pr(Yi  =  1,Y2  =  l;xi,x2)  Pr(yj  =Q,Y2  =  0;xi,x2) 


7  =  712 


(2.3) 


Pr(Yi  =  1,Y2  =  0;xi,x2)  Pr(Yi  =  0 ,Y2  =  l;xi,x2) 

for  all  xi,  x2.  Let  Fu(zi,  z2)  be  the  joint  distribution  of  the  latent  pair  {Z\.  Zz)  and  F(z)  =  (1  +  e-*)-1 
be  the  logistic  cumulative  distribution  function.  Then  (2.3)  is  the  same  as 

=  F 12 (ct  +  /3'xi,q  +  P'x2)  [1  -  F(a  +  /3'xi)  -  F(a  +  /3'x2)  +  F12(a  +  (3'xi,a  +  /3,x2) 

[F(a  +  |9'xi)-Fi2(a  +  /8,xi,a  +  /3,x2)][F(a  +  /3,x2)-Px2(Q!  +  )9,xi,a  +  /3'x2)] 

and  this  equation  can  be  solved  for  F\2(a  -f  /3'xi,  a  +  /3;X2). 

The  multivariate  Plackett  extension  is  given  in  Molenberghs  and  Lessaffe  (1994),  where  (2.3)  and  (2.4) 
are  extended  to  higher  orders;  for  example,  for  d  ~  3  dimensions,  with  7r(yi,  j/2>  Vs)  =  Pr(Yi  —  yllY2  = 

2/2,  *3  =  2/3;Xi,X2,X3), 

=  7r(l,l,l)7r(l,0,0)7r(0,l,0)7T(Q,Q;l) 

7123  7T(1, 1,  0)  7T(1,  0, 1)  7T(0, 1,  1)  7T(0,  0, 0)  •  >  •  ; 

For  the  d-dimensional  product  ratio,  there  are  2d~x  probabilities  each  in  the  numerator  and  denominator. 
Joe  (1997)  shows  that  these  ratios  do  not  lead  to  a  proper  multivariate  logistic  distribution  if  7123  and  higher 
order  7’s  are  close  to  0  or  large.  To  have  the  same  number  of  dependence  parameters  as  the  multivariate 
probit  model,  the  third  and  higher  order  7  parameters  are  taken  to  be  1  (see  Joe  1 997  for  a  maximum  entropy 
interpretation  in  this  case).  (2.5)  and  its  higher-order  equivalents  lead  to  roots  of  polynomials  that  must  be 
computed  to  obtain  Fi...^(a  +  /3'xi, . . . ,  a  +  /3'xd),  the  joint  distribution  of  the  latent  multivariate  logistic 
random  vector. 

To  obtain  maximum  likelihood  estimates  of  the  parameters  of  the  model  (2.2)  and  the  multivariate  logit 
counterpart,  multivariate  rectangle  probabilities  are  required,  since 

Pr(Yj  =  yu...,Yd  =  yd  |xi,...,xd)  =  Pt(Zi  -<yi  a  +  /3'xi,...,Zd  -O-d  a  +  /3'xd) 


where  is  <  if  yj  =  1  and  is  >  if  yj  —  0. 
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Liang  et  al.  (1992)  develop  a  method  called  odds  ratio  regression  or  GEE2  and  Liang  and  Beaty  (1991) 
apply  it  for  familial  aggregation  of  a  binary  trait.  They  do  not  assume  any  joint  distribution  for  (Yi,...,Yd) 
but  estimate  interclass  and  intraclass  odds  ratios  using  estimating  equations  that  generalize  method  of  mo¬ 
ments  equations.  These  estimating  equations  use  multivariate  Plackett  probabilities  for  dimension  2,  3,  and 
4.  Although  Liang  and  Beaty  (1991)  didn’t  mention  any  underlying  model  for  their  method,  their  method 
corresponds  to  an  estimation  method  for  the  multivariate  logit  model  that  is  different  from  maximum  likeli¬ 
hood.  We  will  express  their  estimating  equations  in  the  more  general  context  of  familial  data. 

Let  0  =  (/3,  0),  where  0  is  a  vector  of  log  odds  ratios,  with  different  odds  ratio  parameters  for  different 
relation  types  (similar  to  the  correlations  for  the  probit  model).  Let  =  (yn, . . . ,  yid.)  and  let  = 
( ynyi2 ,  •  •  * ,  V^di-iVidi),  i  =  1, . . . ,  with  n  families  and  di  members  in  the  zth  family.  Let  \x\  =  ^(/3)  = 
(mi, met,),  where  mj  =  E (%•),  and  r)\  =  f]\(Q)  =  (E [yam], . . . , E  [yi^-iVid,])-  The  estimating 
equations  have  the  form: 


(£_*)-  o, 


(2.6) 


where  £*(0)  is  the  covariance  matrix  of  (y*,  w*)  based  on  multivariate  Plackett  probabilities  up  to  dimension 
4. 


3.  Computational  implementations 

Conceptually,  the  models  in  the  previous  section  are  straightforward,  but  computations  for  the  maximum 
likelihood  and  GEE2  estimation  methods  are  not  straightforward.  Both  of  these  methods  require  iterative 
procedure  to  estimate  the  parameters. 

For  the  maximum  likelihood  estimator  (MLE)  of  the  multivariate  logit  model,  we  have  coded  the  com¬ 
putation  of  multivariate  Plackett  probabilities  by  recursively  finding  the  roots  of  many  polynomial  equations. 
The  log-likelihood  can  then  be  coded  and  the  MLE  of  / 3  and  the  0  parameters  can  be  obtained  using  an  it¬ 
erative  quasi-Newton  method;  for  example,  the  method  in  Nash  (1990)  is  convenient  as  it  also  computes  the 
inverse  Hessian  (asymptotic  covariance  matrix)  at  the  MLE.  Because  of  the  recursions,  the  computational 
effort  for  maximum  likelihood  estimation  is  exponentially  increasing  in  the  dimension  or  family  size  d. 

The  GEE2  estimates  of  /3  and  0  are  a  solution  of  the  estimating  equations  (2.6)  and  the  Godambe 
information  matrix  (Godambe,  1991)  at  the  GEE2  estimates  is  used  as  a  asymptotic  covariance  matrix  of 
the  estimates.  To  solve  (2.6),  the  computer  program  of  Liang  and  Beaty  (1991)  and  Qaqish  et  al.  (1992) 
can  be  used  but  it  has  limitations  in  handling  familial  data  in  general  pedigree  form;  it  can  only  handle 
familial  data  in  which  each  pair  is  either  an  interclass  or  intraclass  pair.  This  code  was  written  in  Pascal 
and  not  easy  to  modify  even  after  conversion  to  C  with  p2c.  Therefore,  we  wrote  a  new  implementation  of 
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GEE2,  in  which  the  equations  were  coded  in  C++,  and  the  estimates  of  the  parameters  /3  and  ifc  are  obtained 
using  the  Newton-Raphson  method.  This  method  requires  the  computation  of  the  Hessian  matrix  at  each 
iteration  and  the  elements  of  Hessian  matrix  are  obtained  from  the  first  derivatives  of  the  equations  (2.6).  To 
avoid  differentiating  complex  estimating  equations  analytically,  we  have  used  an  automatic  differentiation 
software  to  compute  the  derivatives  of  the  estimating  equations. 

Automatic  differentiation  (www.autodiff.org)  is  a  relatively  new  procedure  to  compute  derivatives  of  a 
function  mechanically.  It  exploits  the  fact  that  every  function  is  executed  on  a  computer  as  a  sequence  of 
elementary  (+,—,*,••  •)  and  transcendental  (log,  sine,  cosine,  . . .)  operations.  Using  the  chain  rule  of 
differential  calculus  repeatedly  on  these  operations,  automatic  differentiation  computes  the  derivatives  of 
any  user  provided  function.  The  main  advantages  of  automatic  differentiation  over  the  existing  methods  of 
numerical  differentiation  are:  (i)  it  is  truncation  error  free  and  accurate  to  the  working  precision,  (ii)  can  be 
incorporated  with  common  high  level  programming  languages.  A  number  of  automatic  differentiation  soft¬ 
ware,  written  in  different  programming  languages  such  as  FORTRAN,  C/C++,  etc.,  are  available  (Bischof 
et  al.,  1997)  and  are  widely  used  in  many  fields  which  require  numerical  optimization  such  as  engineering, 
oceanography,  etc.  It  can  be  very  useful  tool  in  Statistics  for  obtaining  numerical  derivatives  of  complex 
log-likelihood  functions  or  solving  nonlinear  systems  of  equations  which  are  not  analytically  easy  to  work 
with. 

For  our  implementation  of  GEE2,  we  have  used  the  automatic  differentiation  software  FADBAD  (Bendt- 
sen  and  Stauning  1996)  which  is  written  in  C++  and  found  that  this  is  faster  than  maximum  likelihood  for 
family  sizes  of  5  or  more,  even  with  the  C++  overhead  in  automatic  differentiation.  This  is  expected  be¬ 
cause  GEE2  only  requires  multivariate  Plackett  probabilities  in  dimensions  4  or  less.  Even  with  compiled 
programs  in  C/C++,  the  computational  time  will  be  of  the  order  of  minutes  on  fast  Pentium  computers  if 
there  are  many  families  of  size  6  or  more. 

Our  program  is  written  in  a  form  that  allows  the  user  to  specify  general  relation  classes  (see  examples  in 
Section  2)  and  there  is  a  dependence  parameter  (odds  ratio)  for  each  relation  class.  For  the  simpler  use  of  the 
programs,  the  dependence  parameters  are  not  functions  of  covariates.  This  is  mainly  due  to  mathematical 
or  probabilistic  consistency  of  the  models;  there  is  no  known  way  of  making  the  odds  ratio  dependence 
parameters  be  functions  of  a  covariate  x  so  that  the  resulting  set  of  odds  ratios  are  compatible  for  all  x.  For 
a  categorical  covariate  x ,  one  could  split  the  data  into  groups  for  estimates  of  the  parameters  or  form  extra 
relation  classes.  For  example,  if  the  gender  of  the  parent  might  be  a  factor,  one  could  use  father-offspring 
and  mother-offspring  relation  classes  in  place  of  the  parent-offspring  relation  class. 
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4.  Comparisons  of  the  models/methods 


Latif  (2001)  has  a  simulation  study  to  compare  maximum  likelihood  and  GEE2  estimates  for  the  multi¬ 
variate  logistic  models  which  considers  different  predigrees,  different  number  of  families,  simulations.  The 
simulation  results  are  similar  in  different  cases,  so  in  this  paper  we  discuss  only  one  of  the  simulations.  This 
specific  simulation  has  500  simulations  with  200  families,  each  family  has  five  members  of  three  genera¬ 
tions:  one  grandparent,  one  parent,  one  uncle/aunt,  two  grandchildren. 

The  binary  familial  data  are  simulated  from  multivariate  probit  model  with  one  covariate  age.  [Note  that 
simulation  from  the  multivariate  logit  model  is  much  more  difficult,  since  one  cannot  simulate  the  latent  lo¬ 
gistic  variables  easily  because  of  the  implicit  equations  defining  the  multivariate  distribution  functions.]  The 
standard  deviation  of  the  standard  logistic  distribution  is  7r/\/3  =  1.81,  so  for  the  same  data,  the  regression 
coefficients  of  logistic  regression  are  usually  roughly  1.8  times  the  corresponding  regression  coefficients 
of  probit  regression.  For  two  binary  variables  based  on  a  latent  bivariate  standard  normal  distribution  with 
correlation  p,  the  odds  ratio  depends  on  the  cut-off  points,  but  is  bounded  by 


1  +  (2/7t)  arcsin  p  V 2 
l  —  (2/7r)arcsinpJ 


The  column  2  of  Table  I  shows  the  true  parameters  used  in  multivariate  probit  model  to  simulate 
multivariate  binary  data;  two  different  sets  of  parameters  are  considered.  The  columns  3-4  show  the 
ML  and  GEE2  estimates  of  the  multivariate  logistic  model.  As  expected  the  estimates  of  the  regression 
coefficient  is  about  1.81  times  of  the  corresponding  parameter  used  in  multivariate  probit  model,  e.g. 
0.2  x  1.81  =  0.362  «  0.37.  The  estimates  of  the  dependence  parameters  (log  odds  ratios)  correspond 
roughly  to  the  upper  limit  shown  in  (4.1),  e.g.  logB(0.4)  =  1.1,  logl?(0.6)  =  1.7,  log£(0.8)  =  2.7. 
The  ML  and  GEE2  estimates  were  often  the  same  to  2  or  3  significant  digits.  The  standard  deviation  of 
the  parameter  estimates  and  the  average  standard  errors  in  each  line  are  roughly  the  same.  That  means, 
our  study  show  that  the  ML  and  GEE2  estimates  of  regression  and  dependence  parameters  of  multivariate 
logistic  distribution  are  similar. 


5.  Discussion 


The  multivariate  logit  model  and  GEE2  estimation  method  are  an  alternative  to  the  multivariate  probit  model 
for  binary  trait  data,  and  may  be  useful  to  apply  for  a  sensitivity  analysis.  Liang  and  Beaty  (1991)  mention 
that  their  odds  ratio  regression  model  is  a  more  general  model  and  avoids  the  unobservable  continuous 
trait  of  the  multivariate  probit  model.  However  we  have  shown  that  the  probabilistic  assumptions  behind 
their  method  are  conceptually  very  close  to  that  of  the  multivariate  probit  model  with  latent  logistic  random 
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variables  in  place  of  latent  normal  random  variables.  Our  simulations  also  show  that  the  GEE2  and  ML 
estimates  of  multivariate  logistic  model  are  similar. 

We  have  also  demonstrated  the  usefulness  of  automatic  differentiation  for  the  GEE2  equations.  In  gen¬ 
eral,  automatic  differentiation  can  be  a  useful  tool  in  Statistics  when  there  are  complicated  nonlinear  func¬ 
tions  to  be  solved. 
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Table  I:  Simulation  data  from  multivariate  probit:  average  of  estimates,  average  absolute  differences  (ML 
and  GEE2)  and  average  SEs  assuming  a  multivariate  logit  model. 
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ABSTRACT 

We  propose  two  estimation  procedures  based  on  composite  likelihoods  for  multivariate 
models  with  regression/univariate  and  dependence  parameters  for  which  likelihood  calcula¬ 
tions  are  too  time-consuming.  Examples  are  models  in  familial  data  analysis  with  discrete 
or  censored  response.  The  first  method  is  a  two-stage  method  in  which  the  univariate  pa¬ 
rameters  are  estimated  based  on  the  sum  of  log  likelihoods  of  univariate  margins  and  the 
dependence  parameters  are  estimated  separately  based  on  the  sum  of  log  likelihoods  of  bi¬ 
variate  margins  with  the  univariate  parameters  replaced  by  their  estimates  from  the  first 
stage.  In  the  second  method,  all  the  parameters  are  estimated  from  the  weighted  sum  of 
log  likelihoods  of  bivariate  margins.  The  composite  likelihood  methods  can  greatly  reduce 
computation  in  parameter  estimation,  but  with  a  price  of  efficiency  loss.  For  some  spe¬ 
cial  cases,  we  compare  the  asymptotic  efficiency  of  these  two  methods  with  the  maximum 
likelihood  method.  Our  investigation  includes  the  multivariate  normal  (MVN)  model  for  a 
continuous  variable,  the  multivariate  probit  model  for  a  binary  variable  and  the  MVN  for 
a  right-censored  variable.  We  find  that  the  performance  of  the  two  methods  is  reasonable, 
except  that  when  the  dependence  is  strong,  the  first  approach  is  inefficient  for  the  regression 
parameters.  We  also  find  that  the  second  approach  is  generally  better  for  the  regression 
parameters,  but  less  efficient  for  the  dependence  parameters  when  the  dependence  is  weak. 

1  Introduction 

For  models  used  to  analyze  multivariate  data,  we  consider  estimation  methods  based  on 
log-likelihoods  of  low-dimensional  margins  when  the  multivariate  probabilities  are  too  time- 
consuming  to  compute  but  low-dimensional  margins  probabilities  are  computable.  The  class 
of  models  we  are  considering  have  parameters  that  are  common  to  different  margins,  for 
example,  common  regression  parameters  for  different  univariate  margins,  and  common  de¬ 
pendence  parameters  for  different  bivariate  margins.  In  this  paper,  we  use  familial  data  as 
our  example  to  illustrate  the  estimation  methods. 
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A  specific  context  of  familial  data  is  as  follows.  The  sample  is  formed  by  families  with 
varying  sizes.  We  denote  the  number  of  families  by  n  and  the  number  of  members  in  the  ith 
family  by  fc,.  There  is  a  single  response  measurement  on  each  individual.  This  response  may 
be  a  continuous  variable,  an  indicator  of  some  symptom,  the  right  censored  onset  time  of  a 
disease  feature,  or  some  other  type  of  variable.  We  denote  the  response  measurements  of  the 
ith.  family  by  Yj  =  (Yu, . . . ,  Y^)' .  Often,  there  are  also  a  set  of  covariates  at  the  individual 
level,  which  we  denote  by  X*  =  (xiX, . . . ,  xifci)',  a  ki  x  m  matrix. 

For  a  continuous  response,  a  common  model  is  the  multivariate  normal  model  (Srivastava 
and  Ng,  1990).  The  correlation  matrix  for  a  family  depends  on  the  family  structure.  For 
a  binary  response,  a  common  model  is  the  multivariate  probit  model  (MVP)  with  latent 
multivariate  normal  variables  (Mendell  and  Elston,  1974).  Again,  the  correlation  matrix 
depends  on  the  family  structure.  In  general,  the  multinormal  random  effects  models  and 
the  multinormal  copula  models  introduced  in  Section  2  can  be  used  to  analyze  various  types 
of  familial  data.  In  these  models,  there  are  regression  parameters  which  are  univariate 
parameters,  and  correlation  (dependence)  parameters  such  as  the  sib-sib  or  parent-offspring 
correlations. 

In  many  cases,  for  instance,  the  case  of  binary  or  right-censored  responses,  the  joint 
likelihood  for  a  family  involves  multi-dimensional  integrals  which  become  more  and  more 
difficult  to  evaluate  when  the  family  size  increases.  However,  the  densities  of  the  univariate 
and  bivariate  margins  can  be  easily  calculated.  All  the  parameters  of  the  models  considered  in 
this  paper  are  either  univariate  or  bivariate  parameters.  Hence  to  estimate  the  parameters, 
we  consider  adding  together  log  likelihoods  of  univariate  or  bivariate  margins  to  form  a 
composite  likelihood  (CL),  so  named  by  Lindsay  (1988).  The  ideas  extend  to  other  CLs 
when  there  are  parameters  that  belong  to  higher-order  margins.  Besides  the  computational 
advantage,  a  CL  also  inherits  some  properties  of  the  ordinary  likelihood.  In  particular,  under 
regularity  conditions,  the  estimates  based  on  CL  are  consistent  and  asymptotically  unbiased. 

The  following  are  some  recent  applications  of  CL  in  multivariate  analysis:  Xu  (1996)  pro¬ 
posed  using  inference  functions  formed  by  univariate  and  bivariate  marginal  likelihoods  for 
multivariate  discrete  data;  Heagerty  and  Lele  (1998)  and  Curriero  and  Lele  (1999)  considered 
pairwise  CL  estimation  for  binary  spatial  data;  Parner  (2001)  used  pairwise  likelihood  con¬ 
tributions  to  analyze  familial  survival  data;  Joreskog  and  Moustaki  (2001)  compared  a  CL 
approach  with  two  other  approaches  in  factor  analysis  of  ordinal  variables;  Lele  and  Taper 
(2002)  considered  a  CL  approach  to  estimate  the  variance  components  of  a  MVN  model. 

In  this  paper,  we  study  two  CL  estimation  methods.  One  is  based  on  the  CLs  of  both 
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the  univariate  margins  and  the  bivariate  margins,  and  estimates  the  univariate  and  depen¬ 
dence  parameters  in  two  steps.  The  other  only  uses  the  bivariate  CL  and  estimates  all  the 
parameters  simultaneously.  To  illustrate  the  main  ideas,  we  use  a  simple  case  of  the  MVP 
model  as  an  example.  Suppose  Yy  =  J.(Zy  >  0),  where  /  is  an  indicator  function  and  the 
latent  variable  Zy  has  a  normal  distribution  with  common  mean  p  and  variance  1.  Moreover, 
suppose  Cov(Zij,  Ziji)  —  p  for  all  i  and  j  ^  j'.  Then,  there  is  one  univariate  parameter  p, 
and  one  dependence  parameter  p.  In  the  first  approach,  we  estimate  p  by  maximizing  the 
function: 

=  ^  ^  ^  llilfijy  p)i 

i  j 

where  h(yij-fp)  is  the  univariate  log  likelihood  of  Yq.  Then,  we  estimate  p  by  maximizing 
the  function 

^2  (£>  P)  =  £>  p), 

i  j>jr 

with  respect  to  p,  where  P, p)  is  the  bivariate  log  likelihood  of  (Y^,Y^/)  and  p  is 

the  estimate  obtained  in  the  first  step. 

In  the  second  approach,  we  estimate  p  and  p  in  one  step  by  maximizing  the  function 

h(Vij,  Vij> ;  p,  p), 

i  j>j’ 

where  Wi  is  a  weight  depending  on  the  family  size.  The  motivation  and  choice  of  the  weights 
are  given  in  Section  4.2. 

The  above  ideas  work  for  models  with  covariates,  but  the  notation  is  more  complicated. 
General  classes  of  models  for  which  our  CL  methods  apply  are  given  in  Section  2.  Notation 
for  the  structure  of  the  parameters  in  these  models  is  given  in  Section  3.  In  Section  4,  we 
introduce  the  two  CL  estimation  procedures.  In  Section  5,  we  outline  the  jackknife  method  to 
estimate  the  covariance  matrix  of  the  CL  estimates.  In  Section  6,  we  examine  the  efficiency 
of  these  procedures  by  comparing  them  with  the  ML  method  in  some  special  cases,  for  which 
the  comparison  is  actually  possible.  In  Section  7,  we  apply  the  proposed  methods  to  a  data 
example.  Section  8  concludes  with  some  discussion  and  topics  of  further  research. 

2  Classes  of  models 

In  this  section,  we  list  two  classes  of  models  for  whilch  the  composite  likelihood  estimation 
method  would  be  useful. 
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Multinormal  Random  Effects  Model.  A  random  effects  model  has  a  hierarchical  struc¬ 
ture.  The  first  level  specifies  the  distribution  of  the  observed  response  vector  Y  conditional 
on  an  unobservable  random  vector  A  =  (Ai,  ...,  Ak)'.  Given  A  =  A  =  (Ai,  ...,  A*,)', 
we  assume  that  Yj,  . . . ,  Yk  are  conditionally  independent  with  probability  density  function 
(pdf)  ),  where  q,  denote  other  parameters  which  are  not  random.  The  second 

level  specifies  the  distribution  of  A.  We  assume  that  h( A)  =  Z  ~  N(fi,  S),  for  some  one-to- 
one  function  h.  Examples  include  the  Poisson-lognormal  mixture  model  (Aitchison  and  Ho, 
1989),  and  the  multivariate  lognormal  frailty  model  (Hougaard,  2000,  Chapter  10). 

Multinormal  Copula  Model.  Suppose  Y{  ~  Gh  i  =  1,...  ,k.  We  use  a  multinormal 
copula  to  construct  the  joint  distribution  of  Y : 

at y)  =  ■  ■  ■ .  *r'(G*(»)];  r), 

where  <E»fc  is  the  cumulative  distribution  function  (cdf)  of  fc-variate  normal  distribution  and 
R  is  a  fc-dimensional  correlation  matrix.  The  cdf  Gi  can  be  any  (parametric)  family  of 
univariate  distributions.  For  example,  for  count  data,  Gi  may  be  the  Poisson  or  negative 
binomial  distribution,  and  for  survival  data,  Gi  may  be  the  Weibull  distribution. 

3  Notation 

In  this  section,  we  indicate  the  notation  that  will  be  used  for  developing  the  CL  methods. 

Let  Fki,  fki  and  lki  denote  respectively  the  cdf,  pdf  and  log  likelihood  function  of  Yj. 
We  impose  the  following  distributional  assumptions  on  Yj.  Suppose  Y^-  has  univariate  cdf 
F\{- ;  Vij,~f).  The  parameter  Vij  depends  on  covariates  through  a  function  rj,  i.e.,  i/y  = 
7/(Xy,/3);  (3  is  a  vector  of  regression  coefficients  when  rj  is  a  transform  of  a  linear  function 
of  x.  The  parameter  7  does  not  depend  on  covariates.  In  general,  both  v  and  7  could  be 
vectors.  Let  0\  =  (/3/,7/)'  denote  the  parameter  vector  common  to  YZj.  Next,  suppose  for 
j  7^  j'  that  (Yij,Yij>)  has  bivariate  cdf  • ;  5ijj>)  where  5^  is  a  function  of  a  vector  of 
dependence  parameters  denoted  by  02  Typically  is  a  single  element  of  02  which  depends 
on  the  relation  of  individuals  j  and  /  in  family  i.  We  further  assume  that  the  joint  cdf  of  Y*, 
Fk.  is  fully  specified  by  the  univariate  and  bivariate  parameters,  9  =  (9[,  0'2)'  =  (/3',7,,0,2)'. 

The  above  notation  covers  the  classes  models  from  Section  2.  To  illustrate  the  notation, 
we  give  two  examples  below:  multivariate  normal  and  multivariate  probit.  Another  example 
is  the  multivariate  Poisson  log-normal  model  for  multivariate  count  data. 
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Example  1  MVN  model.  Fi(-;  ^,7)  is  the  N(fi,a2)  distribution  with  v  =  fi  =  x'/3,  7  =  a2. 
For  an  exchangeable  dependence  model,  Corr(Y,j,  Yiy )  =  8ijy  =  a  for  j  >  j',  and  02  =  a.  For 
an  example  of  a  non-exchangeable  dependence  model,  suppose  k%  =  3,  Y*  =  (Ya,  Yi2,Yay, 
and  8m  =  <5i3i  =  &i  and  Si3 2  =  a2.  Then,  d2  =  (a\,  a2)'. 

Example  2  MVP  model.  Fi(-;  ^,7)  is  the  Bernoulli(d>(z/))  distribution  with  v  —  x'(3  and 
7  null.  For  an  exchangeable  dependence  model,  suppose  the  correlation  among  the  latent 
variables  is  a  constant  a.  Then  02  =  a.  Similar  to  Example  1,  one  can  have  non-exchangeable 
dependence. 


4  Estimation  Procedures  based  on  Composite  Likelihood 

The  first  and  second  subsections  respectively  contain  the  two-stage  and  one-stage  CL  meth¬ 
ods  based  on  univariate  and  bivariate  margins.  The  third  subsection  contains  some  asymp¬ 
totic  results  of  the  two  methods. 


4.1  Two-stage  Estimation  Procedure  (CL1) 

We  estimate  9\  =  by  maximizing  the  CL  function  of  univariate  margins  (UCL): 


EE 


i=i  j= 1 


where  z/^-  =  r/(xy,/3).  Note  that  ^ucl  is  a  log  likelihood  if  Yn,---  , Y,fci  are  independent 
for  each  i.  Differentiating  ^vcl  with  respect  to  9\  leads  to  the  following  composite  score 
function: 

/  dhQfiif'ijfl)  4  dvjj  \ 

fn  \  I  du  *  83  \  /i  \ 


Vh.OLl  (^1) 


H  du  dp 

i  Q'y 


With  6 1  fixed  as  the  estimate  from  ^ucLi  we  estimate  02  by  maximizing  the  CL  function 
of  bivariate  margins  (BCL),  which  is  the  summation  of  the  log  likelihoods  of  pairs  ( Y,j ,  Yiy), 


3  >  3  ■ 


^BCl{G  1,  ^2)  =  Y/  53  Viji 


i=l  j>? 


The  corresponding  estimating  function  is 


^2) 


dl2 (Yjj ,  % ;  8jjf ,  z/jj ,  z^j/ ,  7)  dS^y 

r)Rn  ~  1^2-^ 
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Let  V'cli  =  V,2,CLl),•  The  estimate  of  0 ,  denoted  by  Oclu  is  the  solution  of 

V’cli(^)  =  0-  Computationally,  this  procedure  can  be  implemented  in  two  steps: 

Step  1:  estimate  0\  by  0itcLU  a  solution  of  V,i,cli(^i)  =  0  (usually  equivalent  to  maximizing 
^ucl{^  i))- 

Step  2:  estimate  02  by  02,cli,  a  solution  of  '02,cli(^i,cz,i)  #2)  =  0  (usually  equivalent  to 
maximizing  4/bc'l(0i,cli,  ^2))- 

4.2  One-stage  Bivariate  Composite  Likelihood  Estimation  Procedure  (CL2) 

An  alternative  to  the  two-stage  method  is  to  estimate  the  two  sets  of  parameters  simulta¬ 
neously  by  maximizing  the  BCL  function  bcl  defined  in  (2).  However,  this  method  could 
overemphasize  large  families  (for  univariate  parameters)  since  they  contribute  quadratically 
increasing  number  of  pairs  to  the  BCL.  It  is  sensible  to  weight  the  contribution  of  each  family 
according  to  its  size.  An  individual  from  a  family  of  size  k  appears  in  k  —  1  pairs.  Therefore 
we  weight  this  contribution  of  the  family  by  l/(fc  —  1).  When  Yu,...,  Y ^  are  independent, 

EWW  =  '52MYij)+l1{Yir))  =  (h  -  1)5>(^). 

j>j'  j>?  •  j 

After  weighted  by  l/(ki  —  1),  the  CL  function  is  the  log  likelihood  function  of  Yj. 

For  those  families  with  one  member,  we  set  the  weight  as  1  and  use  the  univariate  log- 
likelihood.  Thus,  the  estimating  function  of  the  CL2  method  is 

n 

T CL2{0 )  =  (0), 

2=1 

where  w{  =  1,  i(i)(0)  =  h(Ya]  0)  when  k,  =  1,  and  w,  =  1/(A*-1),  %)(#)  =  h(Yij,  Yif;  0) 
otherwise.  It  yields  the  following  estimating  function 

W»)  =  =£»*(»), 

i 

where  hj(0)  =  dl^)(6)/d0. 

4.3  Asymptotic  Properties 

Theorem  1  states  the  general  asymptotic  properties  that  cover  the  two  CL  methods. 

Let  ip(n)(0)  =  V’i(^)  be  the  inference  function  generated  from  the  CL  method,  where 

ipi(0)  is  the  contribution  of  family  i.  Let  0n  be  a  solution  of  ip^(0)  =  0.  Under  similar 
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regularity  conditions  applied  to  the  MLE  (Serfling,  1980),  E^ipfO)  =  0  (Lindsay,  1988), 
therefore  is  unbiased. 


Theorem  1  Assume  that  the  size  of  the  largest  family  is  bounded  and  the  sample  can  be 
considered  as  a  finite  mixture  of  family  structures.  Under  the  usual  regularity  conditions  on 
the  log-likelihood  of  univariate  and  bivariate  margins ,  as  n  — *  oo, 

y/H(dn-0)±N(0,G?(0))t 

where  Gy,{0)  is  the  Godambe  information  matrix  defined  as: 

=  [jta  EM#)]'  [Bm  M»(0)]  ‘  [lta  D„(»)]  , 


(4) 


with 


»♦(«)  =  “E*  and  M*(«)  =  ±E ',{^(8)^(9)}. 


This  follows  from  the  usual  Taylor  expansion  to  second  order;  see  Godambe  (1991). 

The  covariance  matrix  of  6n  is  approximated  by  V  =  n_1D^1M^(D^1)/,  provided  D^1 
exists. 

For  the  CL1  method,  let  D1)H/  =  -E{dipltCLl/d9v)  and  MhU,  =  E(^JiC£1^,fC£1),  /,/'  = 
1,2.  Then, 

1  /Di,u  0  \  /Min  Mi,i2\ 

=  T>cli  =  —  I  I  and  =  MCli  =  —  I  I  (5) 

71  \Dl,21  Di>22/  n  \Mi_2i  M122 / 

it  follows  that 

Var(0i ,cli)  «  D^J1M1ih(D^J1)/, 

Var(»2,0I1)  »  (-D&D^Drj,  »«.)  Mai  ( ^  , 

— ^1,22^1,21^1,11. 


Cov(0i|CLl,  ^2,CLl)  ~  (Diji  o)  Mcli 


ori 


22 


For  the  CL2  method. 


=  DCi2  =  --  WiE  |  —Tjjp- 1  and  M^,  =  MC£2  =  “  ^  wi E  {h*(0)h'i 


(*)}■ 
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Theorems  2  and  3  show  respectively  the  asymptotic  properties  of  the  CL1  and  CL2  esti¬ 
mators  when  Yu, . . . ,  Yiki  are  independent  for  each  i.  The  theorems  are  proved  by  comparing 
the  Godambe  information  matrix  of  xpCL  1  or  ipCL2  with  average  Fisher’s  information  matrix: 

F(0)  =  lim  -  VE 

n— >oo  77,  ^ 

i 

The  proofs  are  straightforward  but  involves  analyzing  submatrices  of  the  information  matri¬ 
ces;  they  are  given  in  Zhao  (2004). 

Theorem  2  Assume  the  conditions  in  Theorem  1.  LetGCLi{0)  be  the  Godambe  information 
matrix  defined  in  (f)  with  Dy,  and  Mg,  as  in  (5)  and  F(0)  be  Fisher’s  information  matrix 
defined  in  (6).  Suppose  0O  =  (^o, 1^0,2)'  the  true  value  of  6.  When  Yu,  ...  Y^.  are 
independent  for  each  i,  i.e.  fki( y^o)  =  EL;  fiiVij,  0o,i),  we  have  (a) 

Gcli(0o)=F(0o) 

when  0o,2  is  in  the  interior  of  the  parameter  space;  and  (b ) 

lim  GCI/1(0O)i,02)  =  lim  F(0o>i,02) 

6  2“ *00,2  O2 — *00, 2 

when  0O)2  is  on  the  boundary  of  the  parameter  space  and  F (0)  and  GCli(6)  are  continuous 
at  00,2  • 

Theorem  3  Assume  the  conditions  in  Theorem  1.  Under  independence,  i.e.,  /fc(y;0o)  = 
rij/i(%;0o,i),  we  have 

AVar  (0i,c£2)  =  AVar  (0i,ml£i)> 

AVar(02)cL2)  —  AVar (02,mle)  is  non-negative  definite, 

where  AVar  denotes  asymptotic  covariance  matrix.  The  relations  hold  for  the  limits  of  the 
covariance  matrices  if  0o)2  is  on  the  boundary  of  the  parameter  space. 

Theorem  2  suggests  that  under  independence  the  CL1  approach  is  asymptotically  equivalent 
to  the  ML  method.  Theorem  3  suggests  that  under  independence  the  CL2  estimate  of  0i  is 
asymptotically  as  efficient  as  the  MLE  whereas  that  of  02  is  less  efficient  than  the  MLE. 


\diki(Yhe)] 

dlki  (Yj,  0)1 

dd 

00 

(6) 
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5  Jackknife  Estimate  of  the  Asymptotic  Covariance  Matrix  of  6 

Here,  8  denotes  either  the  CL1  or  the  CL2  estimate  of  0.  Different  methods  can  be  con¬ 
sidered  to  estimate  the  asymptotic  covariance  matrix  of  6.  One  approach  is  to  evaluate  the 
Godambe  information  matrix  at  8  analytically.  This  can  be  time-consuming  or  even  some¬ 
times  impossible.  Another  approach  is  the  jackknife  with  one  family  removed  at  a  time.  Let 
0-i  be  the  estimator  of  8  with  Yj  deleted.  The  jackknife  estimator  of  the  covariance  matrix 
of  8  is 

n 

v,  =  £(*_< -*)(*_, -0)'. 

i 

A  proof  of  consistency  of  this  estimator  for  the  i.i.d.  case  and  the  case  with  covariates  is 
given  by  Joe  (1997)  (Chapter  10).  The  theorem  can  be  directly  applied  to  familial  data  with 
fixed  family  structure.  Normally,  family  structures  are  not  identical,  but  we  assume  that  the 
population  under  consideration  is  a  finite  mixture  of  family  structures.  Since  Joe’s  results 
hold  for  each  family  structure,  and  the  estimating  function  is  a  summation  over  all  family 
structures,  the  consistency  can  be  generalized  from  his  proof. 

For  large  samples,  the  jackknife  estimator  can  be  obtained  by  deleting  a  block  of  families 
at  a  time  in  order  to  reduce  the  computation  time. 

6  Efficiency  Comparisons 

The  CL  methods  are  considered  for  reasons  of  computational  ease.  Theoretically  they  are 
not  fully  efficient.  We  would  like  to  know  when  the  CL  estimates  tend  to  lose  more  efficiency 
and  how  much  the  efficiency  loss  is.  In  this  section,  we  offer  some  answers  to  these  questions. 
We  define  the  asymptotic  relative  efficiency  (ARE)  of  the  CL  estimate  as 

ARE(<9Cl)  =  AVar(@ML£)/AVar(0Ci), 

where  AVs,r(9MLE)  and  AVar (Qcl)  are  derived  from  the  inverses  of  Fisher’s  and  the  Godambe 
information  matrices  respectively. 

We  make  comparisons  for  three  types  of  variables:  continuous,  binary  and  survival  subject 
to  right  censoring.  For  each  type  of  data  we  select  one  distribution  as  a  representative:  the 
MVN  for  continuous,  the  MVP  for  binary  and  the  MVN  with  random  right  censoring  for 
survival.  For  each  model,  we  select  some  cases  in  which  the  comparisons  can  be  made.  With 
these  comparisons,  we  gain  some  insight  into  how  the  efficiency  is  affected  by  factors,  such 
as  degree  of  dependence,  family  size  and  data  type. 
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6.1  Models  and  Methods 
Covariate 

In  all  the  cases,  we  consider  one  covariate.  Let  x*  =  (a;^, . . .  ,xife.)  be  the  vector  of 
covariate  values  of  family  i.  To  simplify  the  comparisons,  we  assume  that  the  values  in 
Xj  are  independent  realizations  of  a  random  variable  with  mean  0  and  variance  1.  The 
assumptions  on  the  mean  and  the  variance  can  be  easily  satisfied  by  centering  and  scaling. 

Dependence  Structures 

One  dependence  structure  we  investigate  is  the  exchangeable  dependence  structure.  The 
other  one  is  the  dependence  structure  of  family  units  containing  one  parent  and  multiple 
offspring  (relevant  for  data  on  family  members  affected  with  an  autosomal  dominant  genetic 
disorder).  The  second  case  has  two  levels  of  dependence:  parent-offspring  and  sib-sib  asso¬ 
ciations.  We  also  consider  families  with  either  fixed  or  varying  family  sizes.  Results  of  these 
two  cases  can  give  us  some  idea  for  more  complicated  dependence  structures. 

Models 

We  list  the  representative  models  used  for  the  continuous,  binary  or  right-censored  famil¬ 
ial  data.  Under  each  model,  we  summarize  the  special  cases  chosen  to  indicate  some  general 
patterns  in  the  efficiencies  of  the  CL1  and  CL2  estimates. 

(1)  Continuous  response:  MVN  model 

In  this  model,  Y*  ~  N(Xif3,  <r2Ri(a)),  where  / 3  =  (f30,  Pi)1  and  Rj(a)  =  [pijf(oc)}  with 
a  = 

The  closed  form  of  the  likelihood  of  a  MVN  distribution  enables  us  to  derive  the  theo¬ 
retical  asymptotic  variances  of  both  the  ML  and  the  CL  estimates,  so  that  the  efficiency  can 
be  compared  analytically.  We  treat  the  covariate  as  a  random  variable  and  the  variances 
are  derived  based  on  its  asymptotic  properties.  The  ARE  of  fa  have  a  relatively  simple 
expression  and  allows  us  to  study  its  general  properties.  For  other  parameters,  we  calculate 
the  efficiencies  for  some  special  cases: 

Case  1  Exchangeable  non-negative  dependence. 

Case  2  One  parent  and  multiple  offspring.  The  number  of  offspring  varies  from  1  to  5  with 
equal  proportion.  The  parent-offspring  correlation  (pi)  and  the  sib-sib  correlation 
(p2)  need  to  satisfy:  k*p\  -  (k*  -  1  )p2  <  1  (Srivastava  and  Katapa,  1986),  where  k* 
is  the  maximum  number  of  offspring  in  a  family.  We  fix  the  value  of  pi  at  four  levels, 
(0,  0.3,  0.5  and  0.8),  while  vary  the  value  of  p2  within  its  full  range. 
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Case  3  One  parent  and  multiple  offspring  with  various  combinations  of  family  sizes.  We 
focus  on  studying  the  impact  of  the  distribution  of  family  sizes  on  the  efficiencies. 
We  generated  500  sets  of  families  and  for  each  set,  randomly  generate  families  with 
numbers  of  offspring  varying  from  1  to  7.  Therefore,  each  set  has  a  different  combi¬ 
nation  of  family  sizes.  We  calculated  the  efficiencies  for  each  combination  at  p\  =  0.4 
and  p2  =  0.8.  We  chose  p%  far  away  from  p\  since  the  effect  of  the  relative  dispersion 
of  family  sizes  is  more  obvious  under  strong  conditional  sib-sib  dependence. 

Our  investigation  of  the  MVN  model  is  more  thorough  than  the  other  two  distributions. 
We  hope  the  results  can  provide  us  with  insight  into  the  general  behavior  of  the  CL  estimates, 
not  just  limited  to  the  continuous  response. 

(2)  Binary  response:  MVP  model 

In  a  MVP  model,  it  is  assumed  that  the  observed  binary  response  vector  Y*  depends  on 
a  latent  random  vector  Zf  ~  N(ni:  Rf(a)),  where  =  Xi/3.  When  the  value  of  Ztj  exceeds 
the  fixed  threshold  0,  17?  =  I(Zij  >  0). 


ft 

A 

— 

A 

A) 

ES 

EBB 

0 

0 

0.50 

fism 

0 

0.749 

EEH 

El 

■ 

0 

1 

0.159  -  0.841 

■«» 

l 

0.371  -  0.953 

1.65 

H 

0 

2 

0.023  -  0.977 

0.67 

2 

0.092  ~  0.996 

1.65 

H 

Table  1:  Range  of  Pr(Y  =  1)  for  each  combination  of  (3q  and  (3\  (MVP  model). 

In  our  comparisons,  we  simulate  the  values  of  the  covariate  from  a  uniform  distribution 
on  (—1,1).  These  values  are  independent  both  within  a  family  and  among  families  The 
efficiency  is  compared  at  different  levels  of  /30  and  (listed  in  Table  1).  Only  positive  values 
of  (3  were  investigated.  By  symmetry,  the  results  can  be  extended  to  negative  values.  The 
efficiencies  are  calculated  for  the  following  cases: 

Case  1  Bivariate  case  (k{  =  2  for  all  families).  We  focus  on  the  impact  of  the  familial 
dependence  on  the  efficiency  of  We  generate  x?-  values  for  1000  families.  The 

ARE^  cli  is  evaluated  at  the  nine  combinations  of  /30  and  in  Table  1  with  p 
varying  in  the  range  (—1,1]. 

Case  2  Exchangeable  dependence  with  varying  family  sizes.  We  generate  the  covariate  values 
of  500  families  with  their  sizes  being  randomly  generated  from  1  to  4.  We  compute 
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the  efficiencies  of  the  CL1  and  CL2  estimates  of  all  the  parameters  for  the  first  four 
combinations  of  Po  and  P 1  in  Table  1.  The  other  combinations  are  not  included  since 
in  those  cases  the  numerical  results  are  not  reliable  when  p  exceeds  a  certain  level. 
For  each  combination,  p  varies  from  0  to  0.9. 

Case  3  One  parent  and  multiple  offspring.  We  generate  the  size  and  covariate  values  for  500 
families.  In  each  family,  the  number  of  offspring  was  randomly  generated  from  1  to 
3.  The  same  four  combinations  for  /30  and  pi  in  Case  2  are  considered.  We  first  fix 
the  parent-offspring  correlation  px  at  0.3  vary  the  sib-sib  correlation  p2  from  0  to  0.9. 
We,  then,  set  p\  at  0.6  and  vary  p2  from  0.3  to  0.9. 

(3)  Survival  Data  Subject  to  Right  Censoring:  MVN  Model 

Comparisons  of  this  case  will  help  us  to  gain  some  knowledge  on  how  the  CL1  or  CL2 
estimates  are  affected  by  the  presence  of  censoring.  We  consider  the  MVN  model  for  the 
log  survival  as  in  (1)  with  =  (30  +  PiXy,  but  with  the  response  subject  to  random  right 
censoring.  The  comparisons  are  done  by  simulations  in  which  the  likelihood  is  evaluated  by 
numerical  integration. 

In  the  simulation  study,  we  consider  an  exchangeable  dependence  structure.  Four  levels 
of  dependence  were  considered:  p  =  0.1,  0.4,  0.7  and  0.9.  We  fix  p0  =  1,  Pi  =  1  and  a2  =  4 
since  they  have  little  impact  on  the  efficiency.  In  each  simulation  we  generate  5,000  samples 
each  containing  2,000  families.  The  family  size  are  randomly  generated  from  1  to  4.  The 
values  of  the  covariate  is  generated  from  uniform  (—1, 1).  The  data  are  generated  with  three 
different  censoring  rates:  0,  0.2,  0.4  and  0.8.  The  log  censoring  random  variable  is  assumed 
to  be  N(p,c ,  cr2)  with  pc  chosen  to  achieve  the  censoring  rates. 

6.2  Results 

In  our  investigations,  we  find  that  the  efficiencies  are  mainly  affected  by  two  factors,  degree 
of  dependence  and  distribution  of  family  sizes.  Therefore,  we  summarize  our  results  by  these 
two  factors.  Under  each  factor,  we  present  our  results  by  type  of  parameter:  the  regression 
parameter  (/?i),  the  other  univariate  parameters  (/?0  and  a2)  and  the  dependence  parameters. 

Impact  of  Dependence 
(a)  Pi 

( 1)  MVN.  We  first  give  the  expression  of  ARE^  and  show  the  results  of  the  special 
cases.  We,  then,  give  the  expression  of  ARE^  and  compare  it  with  ARE^ 
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The  MLE  of  (3  is  j3MLE  =  ^Ei  X'R;  *Xi)  Ei  XiRi  ^  The  variance  of  /3MLi5  is 

Var  0ML*)  =  a2  (Ei  XfR-'Xi) _1  • 

The  CL1  estimate  of  /3  is  the  ordinary  least  squares  estimate,  i.e.,  /3CL1  =  X'Xj^  X'Yj. 
Consequently, 

Var(^CI1)  =  (E^) 

i  i  i 

By  treating  the  covariate  as  a  random  variable  with  mean  0  and  variance  1,  we  obtain 


AVar (P1MLE)  -  (7) 

i 

AVar 01>CL1)  ~  cr2/x>  (8) 


where  K  =  Ei^>  the  total  number  of  individuals.  The  relative  efficiency  of  Pitcn  is 
approximately 


ARE  a  = 

Pl,CLl 


AVar(/3x,MLE) 


K 


Pi  ,<71,1 


can  be 


AVar(/?1>ctl)  Ei  tr  Rj 1 

When  the  dependence  is  strong,  Rj  is  close  to  be  singular  and,  as  a  result,  ARE^ 
close  to  0. 

For  Case  1,  common  correlation  p  and  family  size  k,  the  expression  can  be  simplified  as 

{k  -  1  )p2 


ARE 


% 


CLl 


1  - 


1  +  (k  —  l)p 


Clearly,  the  estimate  fiycLi  loses  efficiency  when  p  increases.  As  p  approaches  1,  ARE^ 
approaches  to  0.  This  is  because  when  p  —  1,  Pi^mle  has  no  estimation  error,  whereas  P^cli 
has  estimation  error. 

Figure  1  shows  ARE^^  of  Case  2  (one  parent  and  multiple  offspring).  The  efficiency 
decreases  with  pi  and  approaches  zero  when  the  conditional  sib-sib  correlation,  p2  =  (p2  — 
Pi ) / ( 1  ~~  Pi)j  approaches  to  its  lower  and  upper  bounds. 

The  CL2  estimate  is  (3CL2  =  (Ez  x' A*Xz)  X'A*Y,,  where  A*  =  1  if  A*  =  1, 
otherwise,  A*  =  (%y/(l  —  h))  with 


Ei^(l  -  P2iji)  1  for  j=f, 
~Pijj'(l  ~  P%y)~l  for  3  +  f- 
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Figure  1:  ARE(§icj.2  and  ARE^iCii  for  Case  2  (MVN  model).  p2  is  the  sib-sib  correlation 
conditional  on  the  parent.  The  solid  lines  are  the  efficiencies  of  the  CL1  method  at  different 
levels  of  the  pi  and  the  dashed  lines  are  those  of  the  CL2  method. 


The  variance  of  /3CL2  is 

Var0CI2)  =  <r>  (J2  XiAJX,)' QP  XJAJI^AJX,)  (£  XJAJX,)"1. 

i  i  i 

By  treating  the  covariate  as  a  random  variable  with  mean  0  and  variance  1,  we  have 

AVar 0liCL2)  ~  (9) 

where  B*  =  A*RjA*. 

The  following  theorem,  with  proof  in  the  Appendix,  shows  that  the  CL2  estimate  of  Pi 
is  asymptotically  more  efficient  than  the  CL1  estimate. 

Theorem  4  For  the  case  of  one  covariate,  under  the  assumption  that  the  values  ofxn , . . . ,  x^t 
are  independent  realizations  of  a  random  variable  with  mean  0  and  variance  1, 

AVar(phCL2)  <  AVarffc'CLi), 

where  AVar(/3itCLi)  and  AVar(fi2,CL\)  are  given  in  (8)  and  (9),  respectively. 

For  the  exchangeable  case  with  >  2,  ARE^  =  {[1  +  p-p2(l  -o)](l  -6p)}_1,  where 
a  =  K~x  ki/(ki  —  1)  and  b  =  K~l  fcj/[  1  +  ( fa  —  l)p].  When  p  >  0,  since  0  <  a  <  1  and 
b  >  1  /k,  where  k  =  K/n,  ARE^  >  (1  +  p  -  p2)-\l  -  p/k J"1  >  (1  +  p  -  p2)-1  >  0.8. 

The  ARE^ici2  for  Case  2  is  plotted  in  Figure  1.  The  plots  show  that,  unlike  the  CL1 
estimator,  the  CL2  estimator  performs  very  well  when  pi  or  p2  is  large. 
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(2)  MVP.  The  results  for  the  bivariate  case  (Case  1)  are  plotted  in  Figure  2.  Similar  to 
the  MVN  model,  the  stronger  the  dependence  among  Z* ,  the  less  efficient  is  /3i,cli-  When 
Pi  =  0  and  p  =  1,  ARE/gi  =  0.  When  /?i  ^  0,  there  is  a  less  efficiency  loss. 


Figure  2:  ARE  of  PitcLi  when  family  size  k  =  2  (Case  1,  MVP  model). 


The  results  for  the  exchangeable  case  (Case  2)  are  show  in  Figure  3.  The  CL2  estimator 
of  pi  is  better  than  the  CL1  estimator,  especially  when  p  is  close  to  1  and  P\  is  close  to  0. 
For  the  case  of  one  parent  and  multiple  offspring  (Case  3),  given  pu  the  impact  of  p2  on  the 
efficiency  is  similar  to  the  exchangeable  case;  also,  the  CL1  estimate  loses  more  efficiency 
when  pi  =  0.6  than  when  pi  =  0.3. 


Figure  3:  ARE  of  Pi,cli  and  PitcL2  for  the  exchangeable  case  with  family  size  varying  from 
1  to  4  (Case  2,  MVP  model). 
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(3)  MVN  with  Right  Censoring.  The  efficiencies  from  the  simulations  are  shown  in 
Table  2.  Regardless  of  the  degree  of  censoring,  Pi,cli  loses  efficiency  as  p  increases.  How¬ 
ever,  the  loss  of  efficiency  becomes  less  when  the  censoring  rate  r  increases.  Whereas,  the 
performance  of  Pi, cm  is  satisfactory,  even  though  it  loses  some  efficiency  as  p  increases. 
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0.996 

0.995 

1.000 

0.889 

0.994 

0.988 

0.996 

0.992 

0.995 

0.996 

0.999 

0.876 

0.8 

0.998 

0.994 

0.999 

0.996 

1.000 

0.999 

1.000 

0.884 

0.4 

0.937 

0.788 

0.979 

0.957 

0.960 

0.968 

0.987 

0.967 

0.945 

0.807 

0.980 

0.946 

0.974 

0.970 

0.993 

0.964 

0.945 

0.842 

0.973 

0.936 

0.978 

0.969 

0.992 

0.928 

0.962 

0.906 

0.976 

0.943 

0.986 

0.977 

0.993 

0.903 

0.7 

mm 

0.885 

0.447 

0.924 

0.916 

0.942 

0.958 

0.961 

0.957 

0.2 

0.882 

0.480 

0.909 

0.908 

0.944 

0.949 

0.961 

0.952 

EH 

0.874 

0.547 

0.902 

0.891 

0.949 

0.945 

0.966 

0.922 

EH 

0.894 

0.677 

0.905 

0.887 

0.964 

0.939 

0.966 

0.898 

0.9 

0 

0.846 

0.160 

0.863 

0.885 

0.924 

0.966 

0.934 

0.943 

0.2 

0.841 

0.183 

0.845 

0.876 

0.926 

0.947 

0.937 

0.940 

0.5 

0.840 

0.239 

0.839 

0.869 

0.943 

0.932 

0.949 

0.925 

0.8 

0.871 

0.353 

0.867 

0.889 

0.982 

0.913 

0.976 

0.919 

Table  2:  ARE  of  the  CL1  and  CL2  for  the  exchangeable  case  with  family  size  varying  from 
1  to  4  (MVN  model  with  right  censoring). 

(b)  p0  and  cr2 

From  all  the  cases  we  investigated,  the  performance  the  CL1  and  CL2  estimates  is  satis¬ 
factory.  In  general,  the  efficiencies  decrease  with  the  degree  of  dependence,  but  the  impact 
of  the  dependence  on  these  estimates  is  fairly  small.  Since  the  parameters  p0  and  a2  are 
often  considered  as  less  important  in  familial  analysis,  we  do  not  give  the  details  of  the 
comparisons.  As  an  example,  see  Table  2  for  the  case  of  MVN  model  with  right  censoring. 


(c)  Dependence  parameters 

(1)  MVN.  In  Figure  4,  we  plot  the  efficiency  of  pCLi  and  pCL2  for  the  exchangeable  case 
with  two  family  sizes  2  and  6  of  proportion  3:1.  In  Figure  5,  we  plot  the  results  of  the  case 
of  one  parent  and  multiple  offspring  (Case  2). 

(2)  MVP.  Figure  6  shows  the  results  of  the  exchangeable  case  (Case  2).  The  results  are 
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2-1  “ UCL 

|  --  WBCL 


*“i - 1 - 1 - 1 - 1 - r* 

0.0  0.2  0.4  0.6  0.8  1.0 

P 

Figure  4:  Efficiency  of  pc'Li  and  pcL2  for  the  exchangeable  case  (Case  1,  MVN  model);  family 
sizes  are  2  or  6  with  proportion  3:1. 

similar  for  Case  3  (one  parent  and  multiple  offspring).  The  efficiencies  of  CL1  and  CL2  are 
quite  high  in  all  cases. 

(3)  MVN_  with  right  censoring  The  results  are  shown  in  Table  2.  The  efficiencies  of  both 
the  CL  estimates  slightly  decrease  when  the  dependence  become  stronger.  The  impact  of 
the  dependence  is  similar  to  the  case  when  there  is  no  censoring.  It  appears  that  the  CL2 
method  is  slightly  less  efficient  at  a  high  censoring  rate  than  at  a  low  censoring  rate. 

(d)  Conclusion 

In  summary,  is  most  affected  by  the  degree  of  dependence.  The  ARE  can  be  close 

to  0  when  the  dependence  is  strong.  For  the  binary  and  censoring  cases,  the  impact  of 
dependence  on  $itcLi  is  less  than  that  for  the  continuous  case.  On  the  other  hand,  PitcL2 
is  much  less  affected  by  the  dependence.  The  efficiency  is  considerably  improved  when  the 
dependence  is  strong. 

In  most  cases  we  investigated,  both  CL  methods  yield  reasonable  estimates  of  the  de¬ 
pendence  parameters.  The  CL2  method  is  generally  better  for  stronger  dependence,  even 
though  exceptions  can  occur;  the  CL1  method  is  better  for  weaker  dependence. 

The  Impact  of  Family  Sizes 

The  impact  of  the  distribution  of  family  sizes  is  investigated  for  the  CL1  method  and 
only  based  on  the  MVN  model. 

(a)  Exchangeable  dependence 

In  this  case,  we  have  ARE^  =  [1  +  bp2 /(I  —  p)]-1,  where 

,  _  1  kjjkj  —  1) 

~  K^l  +  {ki-l)p 
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(a)  Pi 


(b)  P2 


Figure  5:  The  efficiencies  of  the  CL  dependence  parameter  estimators  for  Case  2  (one  parent 
and  multiple  offspring;  MVN  model).  p2  is  the  sib-sib  correlation  conditional  on  the  parent. 
The  solid  lines  are  the  efficiencies  of  the  CL1  method  at  different  levels  of  the  pi  and  the 
dashed  lines  are  those  of  the  CL2  method. 


The  term 


k(k  -  1) 

1  +  (k-  1  )p 


is  a  strictly  convex  function  of  k  since 


Hk-  1)  1  =  2(1  -  p) 

dk2  [l  +  (A  —  l)p\  (1  -I-  lie  -  -  ' 


By  Jensen’s  inequality, 


It  follows  that 


b  > 


k-l 

l  +  (k-l)p 


AREa  <  1  - 

Pi, CXI  — 


(fc  -  l)p2 

1  +  (k  —  l)p' 


Equality  holds  if  and  only  if  ki  —  k  for  all  i.  This  implies  that:  one,  for  given  K  and  n, 
the  efficiency  of  ARE^  is  always  lower  when  family  sizes  vary;  two,  roughly  speaking, 
ARE^j  Cil  decreases  when  k  increases. 

For  other  parameters,  the  expressions  are  more  complicated.  But,  there  is  one  thing  in 
common  for  all  of  our  studied  cases:  the  estimates  are  most  efficient  when  the  family  sizes 
are  equal. 


(b)  One  parent  and  multiple  offspring 

In  Figure  7  and  8,  we  plot  the  efficiency  against  the  mean  of  k*  (number  of  offspring), 
k*,  and  the  variance-mean  ratio  of  k*,  V/M.  The  following  patterns  were  observed: 
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Figure  6:  ARE  of  Pclu  and  pcL2  for  exchangeable  dependence  with  varying  family  sizes 
(Case  2,  MVP  model). 


1.  ARE/3i  cli  mainly  depends  on  k*.  Figure  7  shows  that  the  efficiency  is  monotonely 
decreasing  in  k*. 


Figure  7:  ARE  of  against  average  number  of  offspring,  k*  for  one  parent  and  multiple 

offspring  with  various  combinations  of  family  sizes  (Case3,  MVN  model). 

2.  Figure  8  shows  the  efficiencies  of  the  other  four  parameters.  The  x-axis  is  k*  and  the 
y-axis  is  the  variance-mean  ratio  of  the  k*  s.  For  /30,cli,  o' cli  and  P2,cli,  the  efficiency 
is  almost  linearly  decreasing  (at  a  slow  rate)  with  both  k*  and  V/M.  On  the  other 
hand,  ARE^  CL1  is  mainly  affected  by  V/M. 

( c )  Conclusion: 

In  general,  the  efficiencies  of  the  CLI  estimates  tend  to  decrease  as  the  average  family  size 
increases  and  the  relative  dispersion  of  family  size  increases.  When  the  relative  dispersion  of 
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family  sizes  is  large,  the  data  are  mainly  formed  by  either  small  or  large  families.  Individuals 
or  pairs  contributed  by  different  families  are  equally  weighted  in  the  CL1  method.  That  is 
why  large  relative  dispersion  of  family  size  combined  with  high  dependence  has  a  strong 
effect  on  the  efficiency. 


f>0 


<J2 


Pi 


P2 


Figure  8:  ARE  of  /?o,cli>  &cl\i  Pi,cli  and  p2,CLi  for  one  parent  and  multiple  offspring  with 
various  combinations  of  family  sizes  (Case3,  MVN  model). 


7  Example 

This  research  is  motivated  by  analysis  of  data  from  patients  affected  by  an  autosomal  domi¬ 
nant  genetic  disorder  neurofibromatosis  1  (NF1)  (Friedman  and  Birch,  1997).  In  this  section, 
we  use  a  multivariate  probit  model  to  analyze  the  presence  of  subcutaneous  neurofibromas 
on  NF1  patients  and  use  the  CL  methods  to  estimate  the  parameters. 

The  data  are  from  two  large  NF1  databases:  the  National  Neurofibromatosis  Foundation 
International  Database  and  the  Neurofibromatosis  Institute  Database.  We  assume  that  the 
families  are  representative  and  behave  like  a  random  sample.  There  are  867  patients  included 
in  our  analysis.  These  patients  are  from  371  different  families.  Among  all  the  families  there 
are  12  of  size  one,  264  of  size  two,  68  with  three,  18  with  four,  5  with  five,  2  with  six  and  2 
with  seven.  The  numbers  of  affected  pairs  included  are  summarized  in  Table  3. 

Since  subcutaneous  neurofibromas  are  known  to  be  increasing  in  frequency  with  age,  age 
is  included  as  a  covariate  (on  the  log-log  scale).  We  consider  three  dependence  parameters: 
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Type  of  relative  pair 

Number  of  Pairs 

siblings 

353 

parent-offspring 

264 

second  degree 

57 

third  degree 

23 

fourth  degree 

1 

Table  3:  Number  of  affected  pairs  included  in  the  analysis  of  presence  of  subcutaneous 
neurofibromas  in  NF1  patients. 


pss  (sib-sib  correlation),  ppo  (parent-offspring  correlation)  and  p2+  (correlation  between  any 
relative  pair  higher  than  second  degree).  The  correlation  between  a  third  or  fourth  degree 
relative  pair  is  set  to  be  equal  to  the  correlation  between  a  second  degree  relative  pair  because 
there  are  only  24  pairs  of  third  or  fourth  degree.  The  parameters  are  estimated  using  the 
CL1,  CL2  and  MLE  methods.  The  maximum  family  size  is  seven  in  this  dataset,  so  we 
are  able  to  compute  the  MLE  using  the  second  order  approximation  of  Joe  (1995)  for  MVN 
rectangle  probabilities.  The  standard  errors  of  the  CL1  and  CL2  estimates  are  obtained  by 
the  jackknife  method.  The  results  are  reported  in  Table  4. 


CL1 

CL2 

MLE 

estimate 

s.e. 

estimate 

estimate 

s.e. 

Covariate  coefficients: 
intercept 

-3.109 

0.291 

-3.121 

0.288 

-3.199 

0.240 

age* 

Correlations: 

2.517 

0.254 

2.536 

0.250 

2.606 

0.207 

sib-sib  (paa) 

0.660 

0.099 

0.594 

0.121 

0.652 

0.093 

parent-offspring  (ppo) 

0.493 

0.116 

0.443 

0.109 

0.463 

0.096 

other  (p2+) 

0.710 

0.311 

0.698 

0.322 

0.670 

0.275 

age*=log(log(age+2)) 


Table  4:  Parameter  estimates  of  the  Probit  model  for  the  presence  of  subcutaneous  neurofi¬ 
bromas  in  NF1  patients. 


The  point  estimates  from  all  the  three  methods  basically  agree  with  each  other.  The 
standard  errors  of  the  CL1  and  CL2  estimates  are  close  to  each  other,  but  around  20%  larger 
than  those  of  the  MLEs.  It  is  no  surprise  that  the  estimates  of  p2+  have  large  standard  errors 
because  there  are  only  a  small  number  of  2+  relative  pairs  and  half  of  them  are  from  four 
families.  The  estimated  correlations  suggest  that  the  dependence  structure  is  not  far  from 
exchangeable. 
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8  Discussion  and  further  research 

The  two  proposed  CL  methods  can  greatly  reduce  computation  in  parameter  estimation 
when  the  ML  method  is  computationally  too  difficult  to  implement.  The  example  and  oth¬ 
ers  in  Zhao  (2004).  demonstrate  that  the  CL1  and  CL2  methods  work  well  in  practice.  Our 
investigations  in  the  asymptotic  efficiency  comparisons  showed  that  the  relative  asymptotic 
efficiencies  of  these  two  methods  are  satisfactory  except  for  some  extreme  cases.  Moreover, 
those  investigations  help  us  better  understand  under  what  situations  these  methods  work 
well  and  provide  hints  on  how  to  improve  them.  Based  on  our  investigations,  we  recommend 
the  CL2  method  when  the  initial  data  analysis  suggests  a  strong  dependence.  It  generally 
provides  better  estimation  for  all  the  parameters,  especially  the  regression  parameters.  How¬ 
ever,  it  may  be  numerically  more  difficult  to  implement  since  computation  increases  when 
the  total  number  of  parameters  increases.  We  recommend  the  CL1  method  when  the  initial 
data  analysis  suggests  a  weak  dependence.  Moreover,  when  the  CL2  method  is  utilized,  we 
can  use  the  CL1  estimate  as  the  starting  value. 

As  noted  in  Section  4,  the  estimation  with  the  CL  methods  can  be  done  with  numerical 
optimization  rather  than  solving  systems  of  non-linear  equations.  If  the  sample  size  is  not 
large,  the  numerical  optimalization  should  be  done  with  a  constraint  that  the  estimated 
dependence  parameters  lie  inside  the  parameter  space  of  the  multivariate  joint  distribution. 
Without  this  constraint,  the  solution  of  the  estimating  equations  may  be  outside  of  the  pa¬ 
rameter  space.  For  models  based  on  the  MVN  distribution,  the  constraint  on  the  dependence 
parameters  is  that  the  correlation  matrix  of  each  family  is  positive  definite. 

To  improve  the  efficiency  of  estimating  (3  in  the  CL1  method,  we  consider  adding  weights 
in  the  estimating  function  in  (1).  When  is  parametrized  so  that  vit  =  r?(Xy-; /3),  then  in 
(1),  one  could  change  the  estimating  equation  for  (3  from  ]>V  JV  t0 

^x;wig(yfl,...,yifci;A  7), 

i 

where  g(Ya, . . . ,  Yiki\ (3, 7)  is  the  vector  of  dl{Yij\ for  j  =  1, . . . ,  ku  W%  is  a  fox  kt 
weight  matrix,  X*  =  (dr/foP/dp,  ...,drj{xik)/dp)  [X*  =  (xa, . . .  ,xiki)  when  rj{x,P)  =  x'/3]. 
Based  on  the  matrix  version  of  the  Cauchy-Schwarz  inequality  (Chaganty,  1997),  we  can 
determine  the  optimal  Wj’s;  the  asymptotic  efficiency  of  the  estimates  of  /3  with  these  Wj’s 
has  been  shown  to  be  close  to  the  asymptotic  efficiency  of  the  ML  estimator  of  /3  for  special 
cases  of  the  MVN  and  MVP  models.  Future  research  could  be  done  on  weighted  versions 
of  the  estimating  equations  in  (1)  and  (3);  this  includes  an  iterative  approach  to  determine 
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good  weight  matrices. 

Finally,  we  provide  some  remarks  on  the  CL  and  related  estimating  equation  methods. 

The  CL1  and  CL2  methods  are  not  restricted  to  the  models  in  this  paper  or  to  familial 
data.  In  particular,  the  CL2  method  works  for  multivariate  models,  whether  or  not  the 
parameters  are  common  to  the  different  margins.  It  works  even  when  the  univariate  margins 
do  not  belong  to  the  same  distribution  family.  The  parameters  can  be  estimated  as  long 
as  the  bivariate  marginal  likelihoods  can  be  evaluated.  Therefore,  we  can  consider  a  more 
general  form  of  the  BCL  function: 

n 

nCLm = '£fwiY,kii'<yv,Yij'-,o), 

*=i  j>? 

where  is  the  log  likelihood  of  bivariate  margin  of  (Yy,  Yy/).  This  estimation  approach 
can  be  considered  if  the  high-dimensional  likelihood  is  too  difficult  or  impossible  to  compute. 

The  idea  can  be  extended  to  trivariate  CL.  However,  as  the  dimension  becomes  higher, 
the  computational  demands  increase,  so  the  implementation  of  such  an  approach  may  be 
more  difficult.  When  all  the  parameters  are  either  univariate  or  bivariate  parameters,  it  is 
not  known  if  the  efficiency  will  be  improved  by  using  the  likelihoods  of  marginal  distribution 
of  higher  order. 


Appendix:  Proof  of  Theorem  4 

(1)  We  first  show  that  ^tr  A*  >  K.  For  ki  =  1,  tr  A*  =  1.  For  ki  >  1, 

k-  k ■ 

tr  AT  =  (k,  -  I)"1  -  4,)“'  >  (h  -  l)-1  £>  -  1)  =  k„ 

3- 1  tyj  j— 1 

(2)  Next  we  show  that  tr  B*  <  tr  A*.  It  is  true  when  ki  =  1.  For  kt>  1,  we  have 

ki  k{ 

tr  A*  =  (ki  -  1)_1  ^  am  and  trB*  =  (ki  ~  !)-2  X  ay'R*ay> 

j=l  j  =  l 

where  a y  is  the  jth  column  of  A*.  It  suffices  to  show  a^R^a^*  <  a^ki  —  1).  For  j  =  15 


am + cLujpuj + a{  T  pijjr  —  (ki  1 )  G;1 1  • 

3=  2  3=  2  2 


Since 


•  V  %  f*2  ^  **2 

aiU  +  XI  a^jP^3  —  X  l  -  n ?  .  _  X  Y 


ki  2 


: _ —  k  _  i 

2  **1  ■> 
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a^R^i  —  [ki  ~~  l)am  +  am  aujPiij  +  £  ah  +  X)  0>i  1  j  &ilj'  Pijj ' 

3=2  j= 2  &?>  2 

ki  2  ki 


j=2  ~  Pi1! 
PUjPilj'  Pijj 


,2  \2 


+J^2  (!  -  PiyK1  - 


.2 

nu  _ Pilj _  ,  PnjpiirPijf 

m> 2  (!  -  pJyX1  -  pfyO  jJ^2  (!  “  ^yX1  -  for) 

2  2  o 

Pilj  Pilj'  —  ^  Pilj  Pilj' Pijj' 


(*.  -  ikn  -  E  ^7T 


PajX1  ~  Pa/) 


Since  p-y +  P?i/  >  2|pajPa/|  >  2|pajPa/Pii/|,  we  have  p2aj  + p2lf -2piljpilfpijj,  >  0.  Thus, 
we  proved  that  a^R^aa  <  (A:*  —  l)oai-  By  symmetry,  the  inequality  holds  for  all  j. 

From  (1)  and  (2)  we  can  easily  obtain  that  AVar(/3liCi,2)  <  AVar(/31)Cil).  The  equality 
holds  when  R*  =  I  for  alH.  □ 
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Summary.  In  familial  data  analyses,  quantifying  familial  association  is  sci¬ 
entifically  important.  As  analogies  of  the  intraclass  and  interclass  correlations 
of  a  normally  distributed  trait,  we  study  intraclass  and  interclass  (log)  odds 
ratios  for  a  binary  trait.  We  propose  non-parametric  estimators  of  the  odds 
ratios  under  the  assumptions  of  exchangeability  and  closure  of  multivariate 
binary  distributions  under  margins.  These  estimators  are  straightforward, 
except  for  the  consideration  of  how  to  weight  by  family  size.  The  main  re¬ 
sults  are  the  derivations  of  the  asymptotic  variances  of  the  estimators.  The 
relative  efficiencies  of  the  non-parametric  estimators  are  studied  for  some 
parametric  models.  It  shows  that  our  estimators  are  highly  efficient,  and 
that  weighting  by  family  size  is  recommended  for  the  intraclass  odds  ratio. 
The  computations  of  the  estimators  and  their  standard  errors  are  illustrated 
with  two  examples. 
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1.  Introduction 

For  continuous  familial  data  for  which  a  multivariate  normal  distribution  can 
be  assumed  for  each  family,  the  interclass  and  intraclass  correlations  are  used 
to  describe  the  familial  associations.  The  intraclass  correlation  is  of  interest 
when  a  common  correlation  is  assumed  for  each  pair  of  observations  in  the 
same  family.  The  interclass  correlation  is  of  interest  when  there  are  two 
classes  in  each  family  and  exchangeability  is  assumed  for  each  class  within 
a  family.  The  interclass  correlation  is  the  correlation  between  a  pair  with 
one  member  from  each  class.  See  for  example  Donner  and  Koval  (1988)  and 
Srivastava  and  Katapa  (1986). 

This  paper  is  concerned  with  binary  familial  data.  We  study  the  intraclass 
and  interclass  (log)  odds  ratios  as  the  analogies  of  the  intraclass  and  inter¬ 
class  correlations.  This  research  is  motivated  by  an  study  of  clinical  features 
on  patients  affected  by  neurofibromatosis  1  (NF1),  an  autosomal  dominant 
genetic  disorder  (Szudek  et  al.,  2000)  with  highly  variable  expressivity.  The 
objective  is  to  quantify  the  association  between  the  occurrence  of  the  same 
clinical  feature  in  relatives  affected  with  NF1.  Comparison  of  the  degree  of 
association  between  different  types  of  relatives  will  help  us  to  identify  the 
(genetic  and  non-genetic)  sources  of  the  variation. 

We  are  specially  interested  in  two  types  of  associations:  interclass  as¬ 
sociation,  such  as  association  between  parent  and  offspring;  and  intraclass 
association,  such  as  association  between  siblings.  The  following  is  an  exam¬ 
ple  of  interclass  odds  ratio.  Let  X  and  Y  be  the  indicators  of  the  occurrence 
of  a  feature  for  a  parent  and  his/her  offspring,  respectively.  If  the  feature 
is  present,  we  say  that  the  individual  is  affected.  In  this  case  the  indicator 
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equals  1.  Then  there  are  four  possible  outcomes  of  (X, Y)\_  (1,  1),  (1,  0), 
(0,  1),  and  (0,  0).  Suppose  the  probability  vector  associated  with  the  four 
outcomes  are  P  =  (Pi,  P2,  P3,  P4)'.  The  parent-offspring  odds  ratio,  denoted 
by  ri2  is  [PiPi]/[P2P3].  The  intraclass  odds  ratio,  denoted  by  r,  is  defined  in 
a  similar  way. 

When  we  have  n  independent  and  identically  distributed  (i.i.d.)  pairs  of 
indicators  (Xi,Yi),  i  —  1, . . .  ,n,  we  can  form  a  2  x  2  contingency  table  as 
illustrated  in  Table  1.  The  vector  of  counts  (Oj,  02,  03,  O4)  is  a  random 
sample  from  Multinomial  (n;  P).  The  odds  ratio  is  estimated  by 


ri2 


O1O4 

O2O3 


(1) 


The  log  odds  ratio  j12  is  estimated  by  y12  =  log(fi2)  with  asymptotic  stan¬ 
dard  error  (s.e.)  n~x  1  /Pi  (based  on  the  i.i.d.  assumption). 


[Table  1  about  here.] 

For  familial  data  with  a  varying  number  of  members  per  family,  a  sim¬ 
ple  way  to  estimate  the  odds  ratio  is  to  from  a  contingency  table  with  all 
possible  relative  pairs.  For  example,  form  a  contingency  table  with  all  the 
parent-offspring  pairs  to  estimate  the  parent-offspring  odds  ratio.  In  this 
case,  the  pairs  from  a  family  containing  more  than  two  members  are  not 
independent.  Under  the  assumptions  of  exchangeability  and  closure  under 
margins,  the  cross-product  ratio  of  the  table  is  still  a  consistent  estimate  of 
the  odds  ratio.  However,  the  dependence  among  pairs  will  affect  the  variance 
of  the  estimator.  Therefore,  we  need  to  adjust  the  asymptotic  variance  of 
the  estimator  accordingly. 
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The  idea  of  forming  a  2  x  2  contingency  table  by  all  possible  relative  pairs 
has  been  used  by  Hunt  et  al.  (1988)  to  test  familial  aggregation  for  families 
of  a  fixed  size.  The  test  is  based  on  the  standard  y2  statistic  generated 
from  the  contingency  table.  They  showed  that  the  standard  y2  statistic 
is  appropriate  for  intraclass  aggregation  despite  the  dependence  among  the 
pairs.  For  interclass  aggregation,  an  adjustment  based  on  the  affected  rate 
and  the  intraclass  aggregation  is  needed.  Some  of  moment  calculations  in 
this  paper  are  borrowed  from  Hunt  et  al.  (1988). 

The  rest  of  this  paper  is  organized  as  follows.  In  Section  2,  we  derive 
the  asymptotic  variance  of  the  estimator  of  the  interclass  log  odds  ratio. 
We  propose  estimation  of  the  asymptotic  variance  and  do  an  efficiency  com¬ 
parison  of  the  proposed  non-parametric  estimator  with  the  MLE  generated 
from  parametric  models.  In  Section  3,  we  study  estimation  of  the  intraclass 
log  odds  ratio.  Since  the  simple  estimator  tends  to  emphasize  large  fami¬ 
lies  disproportionately,  weights  are  considered  to  improve  the  efficiency.  We 
compare  the  efficiencies  of  the  unweighted  and  weighted  estimators  with  the 
MLE  based  on  parametric  models.  Two  applications  of  these  methods  are 
given  in  Sections  2.5  and  3.6. 

2.  Interclass  Odds  Ratio 

We  are  interested  in  estimating  an  interclass  odds  ratio  when  we  have  mul¬ 
tivariate  binary  data  of  the  following  form.  For  family  i  (1  <  i  <  n),  there 
are  kn  members  in  class  1  and  ki2  members  in  class  2;  for  our  applications 
we  will  think  of  class  1  as  parents  (with  kn  =  1  or  2)  and  class  2  as  off¬ 
spring.  The  binary  values  for  family  i  are  X;  =  (Xn,. . . ,  Xikil)  for  class  1 
and  Y*  =  (lii, . . . ,  Yiki2)  for  class  2. 
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Within  a  family,  exchangeability  is  assumed  in  each  class  and  closure 
under  margins  is  assumed  for  the  joint  probability  distribution.  That  is, 
there  is  a  probability  distribution  p(x,y;  k\,  k2)  such  that 

Pr(Xj  =  x,  Yi  =  y)  =  Pr(X*  =  x*,  Y*  =  y*)  =  p(x,y;  kihki2) 

where  x*  and  y*  are  arbitrary  permutations  of  x  and  y,  respectively.  Under 

this  assumption,  there  is  a  common  interclass  odds  ratio  across  families: 

_p(l,l;l,l)p(0,0;l,l)  _P1P4 
rn  p(l,  0;  1,  l)p(0, 1;  1, 1)  P2P3' 

2.1  Estimator  of  log  odds  ratio  712 

For  each  family,  we  pair  each  member  of  class  1  with  each  member  of 
class  2,  and,  then,  use  pairs  from  all  the  families  to  form  a  contingency  table 
as  in  Table  1.  Since  the  probability  that  a  pair  falls  in  cell  l,  l  =  1, . . . ,  4, 
is  Pi  for  all  the  pairs,  E  (Oj)  =  NPt,  where  N  is  the  total  number  of  cross¬ 
class  pairs.  Therefore,  Pi  =  Oi/N  is  an  unbiased  estimate  of  P*.  As  a 
result,  ri2  =  [0i0i\/[020^  is  an  asymptotically  consistent  estimator  of  the 
interclass  odds  ratio.  We  estimate  the  log  odds  ratio  is  by  712  =  logfi2- 

2.2  Asymptotic  variance  0/712 

We  first  derive  some  general  results,  and  then  state  them  for  the  special 
case  of  kn  =  1  for  all  i. 

General 

Since  712  is  a  function  of  O  =  (Oi,  02, 03, 04/,  we  first  obtain  Var(O), 
and  then  use  the  Delta  method  to  obtain  the  asymptotic  variance  of  712 . 
Clearly  the  distribution  of  O  is  no  longer  multinomial  since  the  pairs  con¬ 
tributed  by  the  same  family  are  not  independent  due  to  the  intraclass  de¬ 
pendence  and  repeated  use  of  the  same  individual  in  multiple  pairs. 


5 


Let  O i  =  (On,  l  =  1,  be  the  counts  in  the  contingency  table  con¬ 
tributed  by  family  i.  Let  iff  be  the  indicator  that  the  jth  pair  contributed 
by  family  i  falls  in  cell  l.  Then  On  =  Yffi  1  iff  ■  The  variance  of  On  is 

Var  (On)  =  £ Var  (jg>)  +  Cov  (iff,  iff)  =  JV,P,(1  -  P,)  +  b,(Pff  -  If), 
i=1  iVi 

where  N%  =  kak^,  the  number  of  cross-class  pairs  formed  by  family  i,  b,  = 
Ni(Ni  —  1)  and  Pff  is  the  probability  that  pairs  j  and  f  both  fall  in  cell  l. 
The  covariance  between  On  and  Ojm,  l  =/--  m,  is 

Cov(0a,  Oim)  =  cov(f;/®,f;/w)  =  f;f;cov(/®,/W) 

3=1  j'=l  j= 1  j'= 1 

=  £Cov(lff,lff‘>)  +  £Cov(lff,lff>) 

3=1  j'ft 

=  -NiPiPn  +  -  PtPm), 


where  is  the  probability  that  pairs  j  and  j'  fall  in  cells  l,  m  respectively. 
Since  the  two  pairs  are  symmetric,  pff  =  Pff . 

Next  we  derive  the  Pff  s.  If  we  randomly  choose  two  distinct  pairs  from 
all  the  pairs  within  family  i,  there  are  three  different  types  of  combinations: 


type  I:  two  pairs  containing  the  same  member  from  class  1  and  different 
members  from  class  2.  This  occurs  with  probability 
(i)  k{\  ki2  (k)2  1)  k{ 2  1. 


OjT 


Ni(Ni  -  1) 


Ni  - 1  ’ 


type  II:  two  pairs  containing  the  same  member  from  class  2  and  different 


(d 


members  from  class  1.  This  occurs  with  probability  arf} 


k%  i  1 

Nt-  1' 


type  III:  two  pairs  containing  no  common  members.  This  occurs  with  prob¬ 
ability  af]j  =  1  —  aff  -  afj . 
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When  ka  =  1  (and  iVj  =  k^),  ag  =  1,  ag  =  afjjj  =  0;  when  fcj2  =  2, 
af  =  a%  =  (ka  -  l)/(2 fci2  -  1),  og  =  l/(2fca  -  1). 

Let  P;m|d  be  the  probability  that  two  distinct  pairs  fall  in  cells  l  and  m 
given  that  the  pair  set  is  of  type  d,  d  =  I,  II  or  III.  Then  P/g  =  E<*  a<dPim\d- 
The  variance  and  covariance  of  O  can  be  easily  obtained  based  on  the 
previous  results.  Since  Oi  =  Ej  On,  we  have 


Var  (Oi)  =  jVPi(l-P1)  +  5S,(if-/f)  (2) 

i=l 

n 

Cov (0(, Om)  =  -NPfn  +  Y^HPil-PiPm),  for  l^m.  (3) 

i=  1 

The  asymptotic  variance  of  7,  a1  given  by  the  Delta  method  is 


dP' 


dP 


J_  djn  ,n^7i2 
at2  ap' c  ^  ap  ’ 


where  —75-,  77)  •  Algebraic  simplification  leads  to  the 

atr  \Pi  Pj  P3  P4/ 


following  result. 


Result  1.  The  asymptotic  variance  0/712  is 

a2  =  jfCTo  +  jfY,BdVd^  (4) 

d 

where  el  =  E?=  1  pfl>  Bd  =  Ei  haf  /N  and 

2Pi4|d  2P23|<i  2Pi2|d  2P13|d  2P 24|,j  2P34|d 

P1P4  P2P3  PiP2  P1P3  P2P4  ~  P3P4  ’  [  ) 

d  =  I,  II  and  III. 

The  term  e%/N  in  (4)  is  the  asymptotic  variance  of  712  based  on  N 
independent  pairs. 


Vd 


EPll\d 

p2 

.1=1  1  . 
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The  magnitude  of  vi  depends  on  the  degree  of  conditional  association 
between  the  class  2  members  given  the  status  of  one  of  the  class  1  member. 
Let  X  be  the  indicator  of  the  class  1  member  and  Yi,  Y2  be  those  of  the  two 
class  2  members.  As  shown  in  Appendix  A,  vi  can  be  written  as 


where  c\  is  the  conditional  correlation  of  Y\  and  Y2  given  X  =  1,  and  C4  is 
the  conditional  correlation  given  X  =  0.  As  a  result, 

{<0  if  the  class  2  individuals  are  conditionally  negatively  dependent, 

=  0  if  the  class  2  individuals  are  conditionally  independent, 

>0  if  the  class  2  individuals  are  conditionally  positively  dependent. 

Likewise, 


where  dx  and  c'A  are  the  conditional  correlations  of  two  class  1  members  given 
the  value  of  one  class  2  member.  However,  there  is  no  such  an  intuitive 
interpretation  for  vm. 

Special  case:  kn  =  1  for  all  i 

When  kn  =  1  for  all  i,  afj  =  ctfn  =  0.  Then  we  have  the  following  result: 
RESULT  2.  When  kn  =  1  for  all  i,  the  asymptotic  variance  0/712  is 

cT2  =  (a20  +  BlVl)/N , 

with  N  =  Zi= 1  ki2  and  Bj  =  £?=: ;1  ki2(kl2  -  1  )/N. 

The  term  Bi  is  a  function  of  the  sample  mean  (M)  and  variance  (V  = 
Yjihi  -  M)2/n )  of  the  ki2s.  In  fact,  5/  =  V/M  +  (M  - 1).  When  the  family 
size  is  a  constant  k,  Bi  =  k  —  1  and  a2  =  (a2  +  (k  —  l)vj)/N. 
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In  general,  the  variance  of  712  increases  when  either  cx  or  C4  becomes 
larger.  For  the  case  when  the  class  2  members  are  independent  conditional  on 
the  status  of  the  class  1  member,  cx  =  C4  =  0,  and  vj  =  0.  Hence  a1  =  a\ /N, 
the  same  variance  as  having  N  independent  pairs.  For  the  case  when  the  class 
2  members  are  perfectly  dependent  conditional  on  the  status  of  the  class  1 
member,  ci  =  C4  =  1,  vj  =  Oq  and  cr2  =  (Bj  +  l)<r|/iV.  So  when  perfect 
dependence  occurs,  if  the  family  size  is  a  constant,  a1  —  ko^/N  =  °o/n- 
As  would  be  expected,  this  is  the  variance  achieved  considering  only  one 
class  2  member  from  each  family.  If  the  family  size  is  not  a  constant,  then 
{Bj  + 1  )/N  >  1/n,  that  is,  the  estimate  is  less  efficient  than  considering  only 
one  class  2  member  in  each  family. 

2.3  Methods  to  estimate  the  asymptotic  variance  0/712  in  (4) 

To  estimate  the  asymptotic  variances  of  712,  we  need  to  estimate  Pim\d 
for  d  =  I,  II,  III.  The  empirical  frequencies  of  different  pair  types  can  be 
used.  However,  if  there  is  limited  number  of  families  containing  three  or  more 
members,  such  estimates  of  the  probabilities  are  not  reliable.  In  this  case, 
we  can  use  the  jackknife  method  to  estimate  the  variance  of  712. 

An  alternative  approach  with  ka  =  1  for  all  i  is  the  following.  In  this  case, 
we  only  need  to  estimate  17,  which  involve  the  conditional  correlations  cx  and 
C4  in  (A.2).  To  estimate  ci,  we  use  all  the  families  whose  corresponding  class 
1  member  has  value  1.  We  denote  the  set  of  such  families  by  I\.  Suppose  7X 
contains  nx  families,  and,  in  total,  ATX  class  2  members.  We  form  an  AN OVA 
table  with  the  Y  values  of  these  class  2  members.  Let  %  —  J2j  Yij/h  for 
i  E  h  and  Y\  —  Yhieh  ^7  The  sum  of  squares  within  and  between 
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families  are: 

SSE  =  and  SSB  =  I]  **(£  -  *i)- 

ie/i  j  i€h 

The  corresponding  mean  squares  are  MSE  —  SSE/(Ni  —  ni)  and  MSB  = 
SSB/(ni  —  1).  From  the  expected  sum  of  squares  (Searle  et  ah,  1992), 

){MSB-MSE) 
aB~  Ei€/,  Mi  -  WiVi) 

is  an  unbiased  estimate  of  Cov  ( ) ,  for  i  G  I\,  and  d^  +  MSE  is  an 
unbiased  estimate  of  Var  (Y^)  for  i  €  I\.  Then  an  estimate  of  C\  is  given  by 
Ci  =  d| / (d|  +  MSE),  with  d#  set  to  be  0  if  it  is  negative.  In  a  similar  way, 

C\  can  be  estimated  based  the  AN OVA  table  of  the  Y  values  of  all  class  2 
members  from  families  with  x%\  =  0. 

2.4  Asymptotic  relative  efficiency  of  J12 

Suppose  that  the  joint  distribution  of  the  binary  variables  can  be  de¬ 
scribed  by  a  parametric  model  p(x,y;  k\,  fo;  0),  which  is  exchangeable  in  x 
and  in  y,  and  is  closed  under  margins.  Then 

712(0)  =  logp(l,  1;  1, 1;  0)+logp(0, 0;  1, 1;  0)-logp(l,  0;  1, 1;  0)-logp(0, 1;  1, 1;  0). 

An  estimate  of  712  can  be  obtained  based  on  the  MLE  of  0,  0.  This  esti¬ 
mate,  denoted  by  712(0),  is  more  efficient  than  the  proposed  non-parametric 
estimate.  In  this  subsection,  we  investigate  the  asymptotic  efficiency  loss. 

In  order  to  compare  the  efficiency,  we  need  to  derive  the  asymptotic 
variance  of  712(0)-  Let  Vh  be  the  asymptotic  covariance  matrix  of  n1//2(0— 0), 
which  is  the  inverse  of  Fisher  information  X.  If  the  proportion  of  families  with 
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sizes  ki,  k2  is  (asymptotically)  qklM  for  fei  =  1, . . . ,  mi,  k2  =  1, . . . , m2,  then 


mi  m2 

fci=l  fc2=l  x€{0}l}fci  y€{0,l}fc2 


5p(x,  y;  jfei,  k2;  0)3p(x,y;  ku  k2;  0) 

86 _ 86' 

pfc  y;h,k2;0) 


The  asymptotic  variance  of  n1/2  [712(0)  -  712]  is 

O0  “  06 

We  calculate  the  relative  efficiency,  i.e.,  the  the  asymptotic  variance  ratio 
between  712(0)  and  712,  for  specific  parametric  models.  Our  conclusions  are: 


(a)  The  relative  efficiency  is  typically  above  0.9. 

(b)  The  relative  efficiency  decreases  as  the  dependence  becomes  strong. 

(c)  The  relative  efficiency  tends  to  decrease  as  the  variance-mean  ratio  of 
family  size  increases. 


In  this  paper,  we  illustrate  these  patterns  using  the  multivariate  binary 
beta-binomial  model  (Section  7.1  of  Joe,  1997).  we  set  kn  =  1  and  ki2 
varying  from  1  to  4  with  equal  frequency.  (If  we  change  the  distribution 
of  family  sizes,  the  efficiency  exhibits  the  same  general  patterns  but  with 
different  magnitudes.) 

The  multivariate  binary  beta-binomial  model  is  specified  as  the  following: 
The  probability  of  the  presence  of  the  binary  trait  among  class  1  members  is 
denoted  by  7r;  given  the  state  of  the  class  1  member,  Xt  =  /,  J=0  or  1,  (the 
subscript  ji  is  omitted  since  kn  =  1),  the  joint  conditional  distribution  of  Y* 
follows  a  multivariate  binary  beta-binomial  model  with  parameters  (a/,/?/). 
Thus,  the  joint  conditional  probability  of  Yj  is 
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where  y  6  {0,  l}fc<2,  y+  is  the  sum  of  the  elements  in  y  and  B(  •  ,  ■  )  is  the 
Beta  function. 

To  illustrate  (a)  and  (b),  we  use  a  simple  case  with  a.\  —  Po  =  a,ao  — 
Pi  =  Pi  that  is,  ci  =  C4  =  l/(a  +  p  +  1).  In  this  case,  the  efficiency  is  not 
affected  by  the  value  of  7r  by  the  symmetry  of  the  conditional  distribution, 
and,  so,  we  set  7r  =  0.5.  The  efficiency  of  712  is  plotted  in  Figure  1.  When 
both  a  and  p  are  small,  e.g.  the  class  2  members  are  strongly  correlated,  the 
efficiency  is  relatively  low.  But  even  when  the  correlation  is  almost  1,  it  is 
still  above  0.85. 

[Figure  1  about  here.] 

Similar  results  can  be  seen  with  our  calculations  based  on  the  multivariate 
probit  model  (Zhao,  2004). 

Pattern  (c)  is  shown  in  Figure  2,  in  which  each  point  corresponds  to 
the  efficiency  associated  with  a  particular  distribution  of  family  size.  In  all 
cases,  the  number  of  class  2  members  in  a  family  is  set  to  be  1  to  4  but  with 
a  randomly  generated  frequency.  So,  the  variance-mean  ratio  varies  from 
case  to  case.  The  efficiencies  are  calculated  with  all  the  parameters  of  the 
beta-binomial  model  kept  constant  («i  =  2,  Pi  —  1,  cto  =  1,  Po  =  2,  tt  =  0.5). 

[Figure  2  about  here.] 

2.5  An  example 

In  this  subsection,  we  estimate  the  parent-offspring  odds  ratio  for  the 
presence  of  intertriginous  freckling  in  NF1  patients.  There  are  261  families 
containing  one  affected  parent  and  his/her  affected  offspring.  Among  these 
families  there  are  187  with  one  child,  52  with  two  children,  22  with  three  to 
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five.  In  total,  there  are  364  parent-offspring  pairs.  The  prevalence  of  inter- 
triginous  freckling  is  0.88  among  the  parents  and  0.80  among  the  offspring. 
The  estimates  of  Pi  to  P4  from  these  pairs  are 

A  =  0.714  (both  affected),  A  =  0.165  (only  the  parent  affected), 

A  =  0.030  (both  unaffected),  A  =  0.091  (only  the  offspring  affected). 

To  obtain  the  standard  error  of  the  log  odds  ratio  given  in  (4),  we  use 
the  ANOVA  approach  presented  in  Section  2.3  to  estimate  the  conditional 
correlations  c\  and  c4  (defined  in  (A.2)).  The  results  are:  c4  =  0.004  and 
c4  =  0.200. 

The  jackknife  method,  in  which  one  family  was  removed  at  a  time,  is  also 
used  to  obtain  the  standard  error.  Both  estimates  are  given  in  Table  2. 

[Table  2  about  here.] 

The  naive  standard  error  in  the  table  is  calculated  ignoring  the  dependence 
among  the  pairs,  which  is  not  very  different  from  the  other  two  standard 
errors,  since  the  conditional  associations  among  siblings  are  weak.  The  stan¬ 
dard  error  based  on  (4)  is  slightly  larger  than  the  jackknife  standard  error. 

3.  Intraclass  Odds  Ratio 

Consider  a  random  sample  of  n  families,  each  containing  a  varying  number 
of  members;  in  applications,  typically  the  members  are  siblings.  Let  fcj  be 
the  number  of  members  in  family  i  and  Y*  =  (yil; . . . ,  Y^)  be  the  vector  for 
a  binary  trait.  We  assume  exchangeability  and  closure  under  margins,  that 
is,  there  is  a  distribution  p( y;  k )  such  that  for  y  of  length  fc,;, 


Pr(Yi  =  y)  =  Pr(Yi  =  y*)=P(y1ij) 
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for  y*  being  any  arbitrary  permutation  of  y.  From  the  assumption  of  closure 
under  margins,  the  intraclass  odds  ratio  is 

p(l,l;2)p(0,0;2)_P1P4_  4J\P4 

p(1»  0;  2)p(0, 1;  2)  P2P3  (I-P-P4)2' 

where  P 1  =  p(l,  1;  2),  P4  =  p(0, 0;  2),  P2  =  P3  =  1  -  Px  -  P4  =  p(l,  0;  2). 

3.1  Estimator  of  log  odds  ratio  7 

For  family  i,  there  are  (**)  possible  pairs.  The  total  number  of  pairs  is 
N  =  J2i= 1  (20  ■  Each  pair  may  fall  into  three  categories:  both  equal  to  1, 
one  equal  to  1  and  the  other  equal  to  0,  and  both  equal  to  0.  Let  0\  be  the 
count  of  pairs  both  equal  to  1  and  04  be  the  count  of  those  both  equal  to  0. 
Then  P*  =  Oi/N  is  an  unbiased  estimate  of  Pt,  for  l  =  1,4.  We  estimate  r  in 


(8)  by 


r  = 


40i04 


(N  —  Oi  —  04)2 

The  corresponding  log  odds  ratio  estimate  is 


(9) 


7  =  log  Oi  -  2  log(lV  -  Ox  -  O4)  +  log  04  +  log  4.  (10) 

3.2  Asymptotic  variance  ofj 

With  similar  derivations  to  Section  2,  we  have 

Var (O,)  =  iW*(l -«)+£>(*#> -if),  for  i  =  1,4,  (11) 

i—  1 
n 

Co v(01,04)  =  -NPtPt  +  ^WPu-PiP*),  (12) 

i=l 

with  bi  =  (2)  ((2)  - 1)  an(l  Phi  =  (l-Oi)Pim|4+OiPjm|3)  l,m  =  1, 4,  where  a{ 
is  the  probability  that  two  random  pairs  from  the  ( **)  possible  pairs  contain 
only  three  distinct  members  and  P y|3  is  the  probability  that  two  such  pairs 
fall  in  the  cell  l  and  m  respectively,  while  Py|4  is  the  same  probability  for  two 
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random  pairs  containing  four  members.  Since  Pi4|3  =  0,  P$  =  (1  —  ajPu^. 
It  was  shown  by  Hunt  et  al.  (1988) 

.  hjkj  -  l)(kt  -  2)/2  4 

ft) (ft)- i)/a  +  K 

With  substitution  for  pf£,  (11)  and  (12)  become 

n 

Var  (Oi)  =  Nm-*)  +  Y,fc*F^  +  bi(l-ai)Puv-biP?), 

i= 1 
n 

Cov  (Oi,  04)  =  —NP1P4  +  £  (Ml  -  Oi)Pi4|4  -  kPlft) . 


i~  1 


Let 


b,  =  £W«8.=E«1-  di)/N  and  P3  =  =  Pi+P2;  (13) 


v,_  ,  pi2  pip4 

Vs_  'P1P4  p42 


V0  =  ( P“|4  P14|4\ 

\Pl4|4  P44|4 )  ' 

The  covariance  matrix  of  (Oi,  04),  denoted  by  V,  can  be  written  as 

3 

V  =  iW0  +  iV^PdVd. 


d=  1 


Let,#,=  F(9A’5n)’-l>ere^  =  -  +  -  and  — 
The  asymptotic  variance  of  7  given  by  the  Delta  method  is: 

3 


<x2  =  N<t>'\Q4>  +  nY,  Bd4>'Vd<t>- 


d= 1 


~P4  +  ~Pi' 
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If  we  write 


v3  =  N2<f>'V3(t>  = 


then  we  have  the  following  result. 

Result  3.  The  asymptotic  variance  of  the  intraclass  log  odds  ratio  estimator 
given  in  (10)  is 

1  1  3 

0-2  =  jv°o  +  Bdvd,  (15) 

d= 1 

with  Bd  defined  in  (13)  and  vd  defined  in  (If). 

When  each  family  contains  the  same  number  of  members,  say  k ,  then 

^  =  nk(k  -  1)  +  ~2~  t4ui  +  (k  ~  3)y2  +  (k  +  l)v3]  | . 

For  the  special  case  where  the  Yq  s  are  mutually  independent  for  each  i 
(with  ki  varying),  let  p  =  Pr(Y0-  =  1)  and  q  =  1-p,  we  have  Pi  =  p2,  P4  =  g2, 
Pi  =pq,  Pu\4  =  p2q2  and  Pu\l  =  pL,  P44|£,  =  qL,  for  L  =  3, 4.  Consequently, 
Vi  =  v2  =  -Vs  =  p~2q- 2  and  £d=i  Bdvd  =  p“V2[£i  +  P2  -  S3]  =  0. 
Hence  a2  =  cr2/7V,  which  is  the  same  variance  when  the  sample  contains  N 
independent  pairs.  Surprisingly,  although  the  pairs  are  not  independent  due 
to  the  inclusion  of  the  same  individuals  in  multiple  pairs,  the  asymptotic 
variance  of  7  is  not  inflated. 

Generally,  when  family  members  are  positively  dependent,  a2  >  <Tq. 


3.3  Weighted  estimator  of  7 

In  the  contingency  table,  a  large  family  contributes  many  more  pairs  than 
a  small  family.  Since  the  number  of  pairs  is  disproportional  to  the  family  size, 
the  large  families  might  be  over-emphasized.  With  this  concern,  we  consider 
assigning  weights  to  the  families. 

Let  rik  be  the  number  of  families  of  size  k  and  =  (Ojfc\  O^)  be  the 
counts  from  these  families.  Consider  a  weight  Wk  for  Then  the  weighted 
total  counts  are  Ow  =  We  estimate  the  log  odds  ratio  by 


7 w  =  log 


(16) 


(N1  -Owl-OwAf' 

where  N'  =  ]Cfc  (*) .  Next  we  derive  its  asymptotic  variance. 

Let  =  Var  {0^)/[nk (2)]-  Then  the  covariance  matrix  of  Ow  is 

V„=5>?n*Qv<». 

Let  4>w  =  (l/Pi  +  1/P2,1/P4  +  I/P2)  •  If  we  write  rfc  =  (N1)2 <f> 
the  asymptotic  variance  of  is 


= = ^e^(£k 


Similar  to  Result  3,  it  can  be  shown  that  t*,  =  a\  T  X)d=i  B^vd,  where  the 
B^s  are  given  by  (13).  In  particular,  r2  =  Uq  since  =  0,  for  all  d. 
Without  loss  of  generality,  we  set  ty2  =  1.  Let  N'  =  n2  +  X)fc>3  'u,/onfc(2)- 
Then, 

al  =  (7^)2  (n^  +  E  wlUk  (2)  Tfc)  • 
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To  find  the  optimal  weights,  we  differentiate  cr^  with  respect  to  the  wks 
and  set  the  derivatives  equal  to  0  and,  then,  solve  the  resulting  equations: 

wkrk  —  N'a -  0,  for  k  >  3, 

Result  4.  The  optimal  weights  which  minimize  the  asymptotic  variance  of 
Tu,  in  (16)  are  wk  =  Tijrk.  The  resulting  asymptotic  variance  is  cr^opt  = 


The  derivation  of  a?„  x  is  as  follows: 


by  the  definition  of  N'.  The  weight  wk  depend  on  Pi,  P4  and  Pim\L,  for 
l,  m  =  1,  4  and  1,  =  3,  4.  In  practice,  these  probabilities  are  unknown  and 
need  to  be  estimated.  Therefore  jw  has  to  be  obtained  iteratively. 

Instead  of  estimating  the  optimal  weights,  a  simple  approach  to  down¬ 
weight  the  large  families  is  to  set  wk  =  2 /(k  —  1)  so  that  wknk  (£)  is  propor¬ 
tional  to  k. 

Result  5.  When  set  wk  =  2 /(k  —  1),  the  asymptotic  variance  of  is 
Simple  =  J~l{ 2ao+(k-2)  [4u!  +  (k  -  3)v2  +  (k  +  l)u3]}.  (17) 

with  N'  =  Y,knkk- 

3.4  Methods  to  estimate  the  asymptotic  variance  of  7 

To  estimate  the  asymptotic  variance  of  7,  we  need  to  estimate  J^i3  and 
Pij\A-  A  simple  way  to  obtain  the  estimates  is  to  use  the  sample  proportions 
based  on  all  possible  sets  of  four  members  to  estimate  the  P^  and  use  all 
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the  possible  sets  of  three  members  to  estimate  the  P^.  However,  if  there  are 
limited  number  of  families  containing  three  or  more  members,  such  estimates 
of  the  probabilities  are  not  reliable.  In  this  case,  the  jackknife  method  can 
be  used  to  estimate  the  variance  of  7. 

3.5  Efficiency  comparison 

We  investigate  the  efficiency  of  the  weighted  and  unweighted  estimators 
by  comparing  them  with  the  MLE  based  on  a  parametric  model.  Consider  a 
parametric  exchangeable  model  p(y;  k,  0)  which  is  closed  under  margins  and 
defined  for  all  k  >2.  Then,  the  bivariate  margin  of  Yi}  Yj  is  p(yi,  yf,  2, 0 )  for 
all  i  j.  The  log  odds  ratio  based  on  the  parametric  model  is 


7(0)  =  logp(l,  1;  2, 6»)  +  logp(0, 0;  2, 0)  -  logp(l,  0;  2, 0)  -  log p(0, 1;  2, 0). 


Let  0  be  the  MLE  and  Vq  be  the  asymptotic  covariance  matrix  of  n1P(0  —  0), 
i.e.,  the  inverse  of  Fisher’s  information  J.  If  the  proportion  of  families  with 
size  k  is  (asymptotically)  qk  for  k  =  2, . . . ,  kmaX)  then 

kmax  dp(y;k,0)dp{y;k,0)  /  ,  , 

2^  — oe - QQ' — /p(y;M)- 

*=2  ye{o,i}'= 

The  asymptotic  variance  of  n1/2 [7(0)  —  7]  is 

Based  on  calculations  for  specific  parametric  models,  our  conclusions  are 
the  following. 


(a)  The  relative  efficiency  with  the  optimal  weights  is  close  to  1  (>  0.9). 

(b)  The  relative  efficiency  of  the  unweighted  estimator  decreases  when  the 
dependence  increases  and  is  not  much  affected  by  p;  the  relative  effi¬ 
ciency  can  get  below  0.6. 
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(c)  The  relative  efficiency  for  the  estimator  with  simple  weights  is  lower 
when  the  dependence  is  weak,  but  it  is  typically  above  0.8. 

We  illustrate  these  patterns  using  a  multivariate  binary  beta-binomial 
model  with  parameters  (a,  ft).  The  probability  of  Yfc  =  yfc  is  given  in  (7). 
The  intraclass  odds  ratio  r  is  (a  +  1  )((3  +  1  )/(a/3)  and  p  =  Pr (Yj  =  1)  = 
a/(a  +  P). 

We  set  the  distribution  of  family  sizes  as  uniform  for  k  =  2, 3, 4, 5  and 
vary  the  intraclass  dependence  at  four  different  values  of  p:  0.05,  0.20,  0.35 
and  0.50.  When  p  is  above  0.5,  it  generates  the  same  results  as  1  —  p.  In  Fig¬ 
ures  3,  we  plot  the  efficiency  under  three  situations:  unweighted,  weighted  by 
optimal  weights  and  weighted  by  family  size  (referred  to  as  simple  weights), 
The  plots  show  that  the  unweighted  estimator  (solid  line)  loses  efficiency 
when  the  dependence  increases;  whereas  its  efficiency  does  not  seem  affected 
much  by  the  affected  rate.  The  optimal  weights  (dotted  line)  greatly  enhance 
the  efficiency,  particularly  when  the  dependence  is  high.  Except  for  the  case 
of  p  —  0.05,  its  efficiency  is  always  above  0.95.  Using  the  simple  weights 
is  worse  than  no  weights  when  the  dependence  is  weak,  however,  the  per¬ 
formance  of  this  estimator  improves  when  the  dependence  increases  and  is 
almost  as  good  as  the  optimally  weighted  estimator  for  strong  dependence. 
The  plots  also  show  that  its  efficiency  climbs  faster  when  p  is  closer  to  0.5. 

[Figure  3  about  here.] 

We  provide  another  example  based  on  the  multivariate  probit  model  in 
Zhao  (2004). 
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We  also  investigate  how  the  distribution  of  family  sizes  affects  the  efficien¬ 
cies  of  the  three  estimators  based  on  the  multivariate  binary  beta-binomial 
model.  We  fix  p  at  0.5  and  the  intraclass  odds  ratio  at  a  low  level  (1.44)  and 
a  high  level  (9).  We  consider  only  two  different  family  sizes:  2  and  6  and 
take  the  distribution  of  family  sizes  with  f2  +  f6  —  1,  where  /2  and  /6  are  the 
proportions  of  families  of  size  2  and  6,  respectively.  In  Figure  4  the  efficiency 
is  plotted  against  /2.  In  both  case,  the  unweighted  estimator  is  least  effi¬ 
cient  when  /2  =  15/16,  that  is,  when  each  type  of  family  contributes  equal 
number  of  pairs  to  the  contingency  table.  The  efficiency  of  the  estimator 
with  optimal  weights  increases  monotonically  with  the  proportion  /2.  When 
the  odds  ratio  is  small,  the  estimator  with  simple  weights  is  least  efficient 
when  /2  =  6/7,  that  is,  each  type  of  family  contributes  an  equal  number  of 
individuals  to  the  sample,  whereas  the  efficiency  of  this  estimator  is  almost 
coincident  with  that  of  the  optimal  weights  when  the  odds  ratio  is  high. 

Moreover,  the  efficiency  of  both  the  estimators,  unweighted  or  weighted 
by  family  size,  decreases  almost  linearly  with  the  variance-mean  ratio  of  the 
number  of  pairs  contributed  by  each  family. 

[Figure  4  about  here.] 

Lastly  we  give  some  considerations  on  using  weights.  When  the  depen¬ 
dence  among  family  members  is  weak,  the  loss  of  efficiency  is  modest  if  the 
unweighted  estimator  is  used,  therefore,  there  is  no  compelling  reason  to  use 
any  weights.  On  the  contrary,  when  the  dependence  is  very  strong  and  there 
is  a  great  deal  of  variation  in  family  size,  the  simple  weights  are  recommended 
since  they  work  almost  as  well  as  the  optimal  weights  and  are  easy  to  use. 
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3.6  An  example 

In  this  subsection,  we  estimate  the  sib-sib  odds  ratio  for  the  presence  of 
intertriginous  freckling  in  NF1  patients.  There  are  193  families  with  at  least 
two  siblings.  Among  them  there  are  151  families  with  two  siblings,  30  with 
three  and  11  with  four.  There  is  one  large  family  with  eight  siblings,  all 
affected.  In  total,  there  are  335  sibling  pairs. 

We  compute  both  the  unweighted  and  simply  weighted  estimates  of  the 
sib-sib  log  odds  ratio.  Theoretically,  the  standard  errors  of  these  two  esti¬ 
mates  can  be  compute  based  on  (15)  and  (17)  if  we  can  estimate  P{j |3  and 
Pij\4,  the  probabilities  of  two  pairs  falling  in  respective  categories.  However, 
to  obtain  reliable  estimates  of  these  probabilities,  usually  a  large  number  of 
families  with  size  over  two  is  needed.  In  this  dataset,  there  are  not  enough 
families  to  produce  the  estimates.  Therefore,  we  use  the  jackknife  method  to 
obtain  the  standard  errors.  The  results  are  reported  in  Table  3. 

[Table  3  about  here.] 

After  weighting,  there  is  a  slight  decrease  in  the  estimate  of  Pi  and  a  slight 
increase  in  the  estimate  of  P4;  the  estimate  of  the  log  odds  ratio  has  a  smaller 
standard  error.  We  also  calculate  the  naive  standard  error  (0.314)  for  the 
unweighted  estimate,  which  is  more  than  10%  smaller  than  the  jackknife 
standard  error. 

4.  Conclusion 

We  have  proposed  simple  methods  to  estimate  the  intraclass  and  interclass 
odds  ratio  across  families  with  varying  sizes.  They  are  based  on  the  standard 
2x2  contingency  table  and  are  easy  to  carry  out.  We  derived  the  asymptotic 
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variance  of  the  odds  ratio  estimators  taking  into  account  the  dependence 
between  pairs  within  a  family.  The  usual  standard  error,  i.e.,  that  based  on 
the  asymptotic  variance  for  independent  pairs,  will  be  too  small  when  there  is 
positive  association  within  families;  hence,  it  will  lead  to  a  confidence  interval 
that  is  too  short. 
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Appendix  A 
Proof  of  equation  (6) 


The  term  uj  depends  on  type  I  pair  sets  in  which  the  two  pairs  share  the 
same  member  from  class  1.  Since  Plm jj  =  0  when  l  =  1,2  and  m  =  3,4  or 
when  l  =  3, 4  and  m  =  1,2, 


vr  =  V  ^ -  -  9  'Pl2|/  _  o^\L  -  V'  Si 
h  p?  P1P2  P3P4  ■“  Pi  ’ 


(A.1) 


where 


Cl 


u\i  nv\i 

Pi  Pv  ’ 


z'  = 


fl  if  Z  =  2, 

2  if  Z  =  1, 

3  if  Z  =  4, 

4  if  Z  =  3. 


(A.2) 


Let  X  be  the  indicator  of  the  class  1  member  and  Y\,  Y2  be  those  of  the 


two  class  2  members.  We  write  py. ]x  =  Pr(Ft  =  y,\X  =  x)  and  pyi2/2|x  = 
Pr(Yi  =  yi,Y2  =  y2\X  —  x ),  for  x,yi  and  y2  in  {0, 1}.  Then 


Pn\i  _  Pt(X  =  1,Y1  =  1,Y2  =  1)  pun 
Pi  Pr(A  =  l,Y2  =  l)  pip’ 

pi2\i  _  Pr(A  =  1,  Fi  =  1,  Y2  =  0)  _  piop  _  Pip  -  pup 
P2  Pr(A  =  l,F2  =  0)  ~  pop  "  l-pi|i  ' 

It  follows  that 

c  _  Plll1  ~p*|i 

Pi|i(l  —  Pi|i)  ’ 
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which  is  the  conditional  correlation  of  Y\  and  Y2  given  X  =  1.  It  can  be 
shown  that  c\  —  c2.  By  symmetry,  cs  =  c4  is  the  conditional  correlation  of 
Y\  and  Y  given  X  =  0.  Therefore,  vi  can  be  simplified  as  in  (6). 
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Figure  1.  Efficiency  of  712  based  on  a  beta-binomial  model  when  ai  =  (30 
a,a>o  =  fa  =  /3 
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Figure  2.  Efficiency  of  712  against  the  variance-mean  ratio  of  family  size 
(beta-binomial  model). 


5  10  15  20  25  5  10  15  20  25 

intraclass  odds  ratio  intraclass  odds  ratio 


Figure  3.  Efficiency  comparison  of  7  estimators.  Solid  line:  no  weights,  dot¬ 
ted  line:  optimal  weights,  dashed  line:  simple  weights  (Multivariate  binary 
beta-binomial). 
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Figure  4.  Efficiency  comparison  of  7  estimators  with  varying  proportions 
of  families  with  size  2  and  6.  Solid  line:  no  weights,  dotted  line  with 
optimal  weights,  dashed  line:  simple  weights.  In  the  figure  on  the  right  hand 
side,  the  dotted  line  with  “+”  and  the  dashed  line  are  on  top  of  each  other. 
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Y 

1  0 


Ox 

O2 

Oz 

O4 

Table  1 

Contingency  table  formed  by  independent  pairs 
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log  odds  ratio 

s.e. 

naive  s.e. 

jackknife  s.e. 

0.368 

0.402 

0.377 

0.387 

Table  2 

Estimate  of  parent-offspring  log  odds  ratio  for  the  presence  of  intertriginous 

freckling  in  NF1  patients. 
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with  no  weights 

with  simple  weights 

Pi  (both  affected) 

0.704 

0.668 

P4  (both  unaffected) 

0.075 

0.096 

7  (log  odds  ratio) 

1.461 

1.531 

Jackknife  standard  error  of  7 

0.357 

0.348 

Table  3 

Estimates  of  sib-sib  log  odds  ratio  for  the  presence  of  intertriginous  freckling 

in  NF1  patients. 
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Predictors  of  the  Risk  of  Mortality  in  Neurofibromatosis  2 
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Hospital,  and  department  of  Otolaryngology,  Manchester  Royal  Infirmary,  Manchester,  United  Kingdom 


To  evaluate  clinical  and  molecular  predictors  of  the  risk  of  mortality  in  people  with  neurofibromatosis  2  (NF2), 
we  analyzed  the  mortality  experience  of  368  patients  from  261  families  in  the  United  Kingdom  NF2  registry,  using 
the  Cox  proportional-hazards  model  and  the  jackknife  method.  Age  at  diagnosis,  intracranial  meningiomas,  and 
type  of  treatment  center  were  informative  predictors  of  the  risk  of  mortality.  In  Cox  models,  the  relative  risk  of 
mortality  increased  1.13-fold  per  year  decrease  in  age  at  diagnosis  (95%  confidence  interval  [Cl]  1.08-1.18)  and 
was  2.51-fold  greater  in  people  with  meningiomas  compared  with  those  without  meningiomas  (95%  Cl  1.38- 
4.57).  The  relative  risk  of  mortality  in  patients  treated  at  specialty  centers  was  0.34  compared  with  those  treated 
at  nonspecialty  centers  (95%  Cl  0.12-0.98).  In  a  separate  model,  the  relative  risk  of  mortality  in  people  with 
constitutional  NF2  missense  mutations  was  very  low  compared  with  those  with  other  types  of  mutations  (nonsense 
or  frameshift  mutations,  splice-site  mutations,  and  large  deletions),  but  the  Cl  could  not  be  well  quantified  because 
there  was  only  one  death  among  people  with  missense  mutations.  We  conclude  that  age  at  diagnosis,  the  strongest 
single  predictor  of  the  risk  of  mortality,  is  a  useful  index  for  patient  counseling  and  clinical  management  (as  are 
intracranial  meningiomas).  To  ensure  optimal  care,  we  recommend  that  people  with  NF2  be  referred  to  specialty 
treatment  centers. 


Introduction 

Neurofibromatosis  2  (NF2  [MIM  101000])  is  an  auto¬ 
somal  dominant  disorder  that  is  caused  by  inactivating 
mutations  or  loss  of  both  alleles  of  the  NF2  tumor-sup- 
pressor  gene  (Rouleau  et  al.  1993;  Trofatter  et  al.  1993). 
Vestibular  schwannomas  (VSs),  intracranial  menin¬ 
giomas,  spinal  tumors,  peripheral  nerve  tumors,  and  pre- 
.  senile  lens  opacities  are  common  in  NF2  (Evans  et  al. 
1992 a;  Parry  et  al.  1994;  Mautner  et  al.  1996).  VSs 
occur  in  —95%  of  adults  with  NF2  (bilateral  VSs  are 
pathognomonic  for  NF2),  meningiomas  in  -50%,  and 
presenile  lens  opacities  in  — 60%-80%.  Spinal  tumors 
occur  in  -90%  of  people  with  NF2,  although  only  30% 
of  these  people  have  symptomatic  spinal  tumors  (Maut¬ 
ner  et  al.  1995). 

Cross-sectional  studies  of  genotype-phenotype  cor¬ 
relations  in  NF2  have  found  that,  in  general,  consti¬ 
tutional  nonsense  and  frameshift  NF2  mutations  are 
associated  with  severe  disease;  missense  mutations,  in- 
frame  deletions,  and  large  deletions  with  mild  disease; 
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and  splice-site  mutations  are  associated  with  variable 
disease  severity  (Merel  et  al.  1995;  Kluwe  et  al.  1996, 
1998;  Parry  et  al.  1996;  Ruttledge  et  al.  1996;  Evans 
et  al.  1998#).  There  have  been  few  longitudinal  studies 
of  NF2,  because  of  the  rarity  of  the  disease  (Evans  et 
al.  1992 b;  Antinheimo  et  al.  2000).  In  two  recent  short¬ 
term  longitudinal  studies  of  the  predictors  of  VS  growth 
rates  in  NF2,  VS  growth  rates  tended  to  be  higher  in 
people  with  a  younger  age  at  onset  or  diagnosis  of  NF2, 
but  they  were  highly  variable,  even  among  affected 
relatives  of  similar  ages  in  a  single  family  (Baser  et  al. 
2002;  Mautner  et  al.  2002).  As  yet,  there  have  not  been 
any  longitudinal  studies  of  other  common  tumors  in 
NF2,  such  as  intracranial  meningiomas  or  spinal  tu¬ 
mors,  which  also  cause  considerable  morbidity  (Evans 
et  al.  2000). 

NF2  is  a  chronic  disease  in  which  life  expectancy, 
although  often  shortened,  is  lengthy.  Short-term  studies 
of  the  growth  rates  of  NF2-associated  tumors,  especially 
studies  of  a  single  tumor  type,  do  not  reflect  the  total 
disease  burden  of  NF2  or  the  efficacy  of  treatment  as 
well  as  do  long-term  studies  that  utilize  a  more  inclusive 
measure  of  health,  such  as  mortality.  Evans  et  al.  (1992#) 
found  that  mean  actuarial  survival  in  people  with  NF2 
was  62  years,  and  Parry  et  al.  (1994)  reported  that  broad 
categories  of  NF2  disease  severity  (mild,  intermediate, 
and  severe)  were  correlated  with  age  at  death.  Neither 
of  these  studies  evaluated  specific  clinical  or  molecular 
characteristics  as  potential  predictors  of  the  risk  of  mor- 
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tality.  In  the  present  study,  we  evaluated  clinical  and 
molecular  predictors  of  the  risk  of  mortality  in  a  large 
series  of  people  with  NF2. 

Subjects  and  Methods 


Patient  Population 

The  United  Kingdom  NF2  registry  is  based  in  the 
Department  of  Medical  Genetics,  St.  Mary’s  Hospital, 
Manchester.  People  are  ascertained  by  contacting  neu¬ 
rosurgeons,  otolaryngologists,  neurologists,  pediatri¬ 
cians,  dermatologists,  and  geneticists  throughout  the 
United  Kingdom,  and  the  collection  is  augmented  in 
the  North  West  Region  by  the  Regional  Cancer  Reg¬ 
istry.  The  present  study  was  subject  to  continuing 
ethics  committee  evaluation,  and  participants  gave  in¬ 
formed  consent. 

As  of  February  15,  2002,  the  registry  had  425  people 
from  282  families.  For  the  present  study,  we  excluded 
three  groups  of  people:  (1)  We  excluded  known  somatic 
mosaics  (N  =  17;  shown  to  be  mosaic  at  the  molecular 
level).  Almost  all  reported  NF2  somatic  mosaics  have 
mild  disease,  despite  having  nonsense  or  frameshift  mu¬ 
tations  that  typically  cause  severe  disease  in  classical 
NF2  (Evans  et  al.  1998b;  Kluwe  and  Mautner  1998). 
(2)  We  excluded  people  who  were  born  before  1930 
(N  =  34).  All  such  individuals  in  the  United  Kingdom 
NF2  registry  were  identified  through  younger  affected 
relatives.  The  pre-1930  group  was  excluded  because  it 
did  not  meet  the  proportional-hazards  assumption  for 
the  Cox  analysis.  Specifically,  type  of  treatment  center 
and  meningiomas  did  not  predict  the  risk  of  mortality 
in  people  who  were  born  before  1930,  in  contrast  to 
those  born  after  1930.  (3)  We  excluded  people  with  miss¬ 
ing  information  on  either  age  at  diagnosis  ( N  —  2)  or 
presence  of  intracranial  meningiomas  (N  =  4),  two 
covariates  that  are  necessary  for  analysis  of  the  risk 
of  mortality. 

The  resultant  study  group  had  368  people  from  261 
families,  all  of  whom  met  the  Manchester  clinical  di¬ 
agnostic  criteria  for  NF2  (Evans  et  al.  1992a)  or  had 
identified  constitutional  NF2  mutations.  The  distribu¬ 
tion  of  the  number  of  affected  family  members  was  206 
families  with  one  affected  member,  27  families  with  two 
affected  members,  15  families  with  three  affected  mem¬ 
bers,  7  families  with  four  affected  members,  3  families 
with  five  affected  members,  1  family  with  six  affected 
members,  and  2  families  with  seven  affected  members. 
There  were  43  two-generation  families  and  9  three-gen¬ 
eration  families.  There  were  223  people  with  new  mu¬ 
tations  (individuals  with  sporadic  disease  and  founders) 
and  145  people  with  inherited  mutations. 


NF2  Mutation  Analysis 

Genomic  DNA  samples  prepared  from  peripheral 
leukocytes  were  amplified  with  primers  for  all  17  exons 
of  the  NF2  gene,  and  screening  was  performed  for  con¬ 
stitutional  NF2  mutations,  using  SSCP  analysis,  as  de¬ 
scribed  elsewhere  (Evans  et  al.  1998 a). 

Statistical  Analysis 

Potential  predictors  of  the  risk  of  mortality  were  first 
assessed  using  univariate  Kaplan-Meier  survival  curves. 
The  covariates  examined  were  age  at  onset  of  symptoms, 
age  at  diagnosis,  sex,  type  of  constitutional  NF2  mutation, 
inheritance  (new  mutation  or  inherited  case),  generation 
(in  the  52  multigeneration  families),  presence  and  number 
of  each  type  of  NF2  nervous  system  tumor  (VSs,  intra¬ 
cranial  meningiomas,  spinal  tumors,  and  peripheral  nerve 
tumors),  lens  opacities,  number  of  surgical  operations, 
calendar  year  of  diagnosis,  and  type  of  treatment  center 
(specialty  or  nonspecialty).  The  specialty  treatment  cen¬ 
ters  were  hospitals  with  NF2  specialist  management  teams 
(Manchester  Royal  Infirmary,  Addenbrooke’s  Hospital 
[Cambridge],  and  Royal  London  Hospital). 

The  number  of  tumors  at  diagnosis  was  used  because 
data  from  serial  examinations  were  not  routinely  avail¬ 
able.  A  potential  concern  about  this  choice  is  that  im¬ 
aging  done  prior  to  the  late  1980s  (i.e.,  with  comput¬ 
erized  tomography)  may  be  of  poorer  quality  and  may 
tend  to  underestimate  the  number  of  tumors,  in  com¬ 
parison  with  imaging  done  more  recently  (i.e.,  with  mag¬ 
netic  resonance  imaging).  Several  lines  of  evidence  in¬ 
dicate  that  this  potential  bias  is  negligible  in  people  with 
meningiomas  or  spinal  tumors.  The  number  of  menin¬ 
giomas  did  not  vary  significantly  by  calendar  year  of 
diagnosis  (P  =  .95),  after  adjustment,  using  linear  re¬ 
gression,  for  age  at  diagnosis  of  the  first  intracranial 
meningioma.  Some  people  present  with  VSs  but  later 
develop  intracranial  meningiomas  or  have  meningiomas 
that  may  not  have  been  detected  at  presentation  because 
of  suboptimal  imaging.  Of  the  136  people  with  both  VSs 
and  meningiomas,  25%  were  diagnosed  with  menin¬ 
giomas  after  their  VSs  were  diagnosed,  and  the  median 
time  between  detection  of  VSs  and  meningiomas  in  these 
people  was  only  four  years.  Some  people  may  not  have 
spinal  imaging  early  in  the  course  of  their  disease,  but 
the  median  time  between  initial  assessment  and  diag¬ 
nosis  of  spinal  tumors  was  only  one  year. 

The  presentation  of  NF2  is  different  in  adults  than  in 
children,  whose  initial  sign  or  symptom  is  often  unre¬ 
lated  to  VSs  (Mautner  et  al.  1993;  MacCollin  and  Maut¬ 
ner  1998).  For  this  reason,  interactions  between  age  at 
diagnosis  and  number  of  each  type  of  nervous  system 
tumor  were  evaluated  in  the  Cox  models;  none  of  these 
interactions  were  statistically  significant.  Age  at  onset  of 
symptoms  and  age  at  diagnosis  were  highly  correlated 
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Table  1 


Characteristics  of  368  People  with  NF2  in  the  Study  Population 


Characteristic 

No. 

% 

Vital  status:8 

Alive 

295 

80.2 

Dead 

73 

19.8 

Sex: 

Female 

191 

51.9 

Male 

177 

48.1 

Inheritance: 

New  mutation 

223 

60.6 

Inherited  mutation 

145 

39.4 

Age  at  onset  of  symptoms  (years): 

1-19 

153 

41.8 

20-39 

155 

42.3 

2*40 

34 

9.3 

Asymptomatic 

24 

6.6 

Age  at  diagnosis  (years): 

1-19 

116 

31.5 

20-39 

186 

50.6 

5*40 

66 

17.9 

VS: 

None 

23 

6.3 

Unilateral 

33 

9.0 

Bilateral 

310 

84.7 

Intracranial  meningiomas: 

Absent 

203 

55.2 

Present 

165 

44.8 

Type  of  constitutional  NF2  mutation:b 

Nonsense 

43 

13.5 

Frameshift  deletion 

27 

8.5 

Frameshift  insertion 

10 

3.1 

Splice  donor  site 

19 

6.0 

Splice  acceptor  site 

36 

11.3 

Missense 

22 

6.9 

Large  deletion 

47 

14.8 

Not  identified 

114 

35.8 

Type  of  treatment  center: 

Nonspecialty 

259 

70.4 

Specialty 

109 

29.6 

a  As  of  February  15,  2002. 

b  Frequency  of  mutations  in  318  people  from  216  families  that  were 
screened  for  constitutional  NF2  mutations  (molecular  model). 


(r2  =  .63,  P<  .001);  age  at  diagnosis  was  used  in  the 
analysis  because  tumor  burden  was  usually  first  evalu¬ 
ated  at  this  time.  Age  at  diagnosis  and  calendar  year  of 
diagnosis  were  not  highly  correlated  (r2  =  .01,  P  = 
.15).  Initial  data  analyses  included  some  separate  anal¬ 
yses  for  people  with  new  mutations  and  those  with  in¬ 
herited  disease,  but  the  two  groups  were  similar  in  sur¬ 
vival  rates  (as  shown  by  Kaplan-Meier  curves),  so  people 
with  new  mutations  and  those  with  inherited  disease 
were  combined  in  further  analyses.  Proband/nonpro- 
band  status  and  new  mutation/inherited  mutation  status 
were  highly  correlated,  so  proband  status  was  not  con¬ 
sidered  further  as  a  covariate. 

The  covariates  that  were  most  highly  associated  with 
the  risk  of  mortality  in  single-predictor  Cox  propor¬ 
tional  hazards  models  were  added  sequentially  to  a  mul¬ 


tiple-predictor  Cox  model  until  there  was  only  a  minor 
decrease  in  the  log  partial  likelihood.  The  log  partial 
likelihood,  an  output-summary  value  of  Cox  regression 
analysis,  using  S-Plus  (Venables  and  Ripley  1997,  is  con¬ 
sidered  to  be  an  indicator  of  how  well  a  set  of  potential 
predictors  explain  the  variation  in  survival  times;  be¬ 
cause  of  the  dependence  within  families,  it  is  not  the 
actual  partial  likelihood.  We  constructed  two  Cox  mod¬ 
els,  one  without  and  one  with  information  on  the  type 
of  constitutional  NF2  mutation.  The  resulting  two  mod¬ 
els  are  called  the  “clinical  model”  and  the  “molecular 
model.”  The  mutation  type  co variate  is  categorical  and 
is  coded  as  several  binary  variables.  These  variables  were 
indicators  of  splice-site  mutations,  missense  mutations, 
large  deletions,  and  unidentified  mutations,  each  com¬ 
pared  with  nonsense  or  frameshift  mutations.  The  cat¬ 
egory  of  unidentified  mutations  can  include  different 
types  of  mutations  but  was  treated  as  a  single  category 
for  the  purpose  of  this  analysis.  The  molecular  model 
excluded  46  people  who  had  not  been  screened  for  con¬ 
stitutional  NF2  mutations,  2  people  with  in-frame  de¬ 
letions,  and  2  people  with  chromosomal  translocations. 

The  Cox  model  assumes  independence  of  families  and 
independence  of  members  within  families.  The  latter  part 
of  this  assumption  is  violated  when  there  is  correlation 
between  family  members.  When  the  data  are  positively 
correlated,  the  SEs  of  parameters  estimated  under  the 
assumption  of  independence  will  tend  to  be  too  small, 
which  could  lead  to  an  erroneous  conclusion  that  an 


Figure  1  Kaplan-Meier  survival  curve  (log-rank  test):  age  at 
diagnosis  (P<  .0001). 
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Age  (years) 


Figure  2  Kaplan-Meier  survival  curve  (log-rank  test):  intra¬ 
cranial  meningiomas  (?  =  .0003), 

effect  is  important.  To  surmount  this  problem  for  the 
Cox  models,  the  delete-one  jackknife  method,  with  fam¬ 
ily  as  the  unit,  was  used  to  determine  the  adjusted  pa¬ 
rameter  estimates  and  their  SEs  in  the  Cox  model.  The 
jackknife  is  a  standard  statistical  method  that  is  com¬ 
monly  used  to  correct  for  bias  in  parameter  estimates 
and  to  provide  SEs  in  nonstandard  sampling  situations 
(Mosteller  and  Tukey  1977). 

In  addition  to  fitting  Cox  models,  we  fitted  log-normal 
regression  models  (regressing  log  survival  on  sets  of  pre¬ 
dictor  variables),  using  the  S-Plus  function  called  “surv- 
reg.”  The  results  are  qualitatively  the  same  as  those  of 
the  Cox  model.  To  assess  the  amount  of  intrafamilial 
correlation,  p,  the  simplest  approach  is  to  add  familial 
dependence  to  the  log-normal  regression  model  and  use 
a  multivariate  log-normal  distribution  for  the  vector  of 
log  survival  for  each  family.  The  simplest  dependence 
structure  is  exchangeable  dependence;  this  assumes  a 
common  correlation  for  each  pair  of  family  members. 
This  dependence  obtains  from  a  random-effects  model 
with  a  common  family  effect  plus  individual  effects. 
In  notation,  the  model  is  log  =  jft0  +  0 fxt]  +  Fx  +  Eij? 
where  i  is  the  index  for  families,  j  is  the  index  for  mem¬ 
bers  within  a  family,  Y  is  the  survival  time,  xx]  is  a  vector 
of  co variates,  FjS  are  independent  family  effects  that  have 
a  normal  distribution  with  mean  0  and  variance  <Xp,  EMs 
are  independent  individual  effects  that  have  a  normal 
distribution  with  mean  0  and  variance  a f.  The  ratio  of 
variances  between  and  within  families  is  0  =  Up/af,  and 
the  intrafamilial  correlation  isp  =  61(1  +  6).  For  a  fam¬ 


ily  with  left-censored  Yij5  the  likelihood  contribution  in¬ 
volves  a  multivariate  normal-rectangle  probability;  for 
the  random-effects  model  given  above,  the  rectangle 
probability  can  be  computed  as  a  one-dimensional  nu¬ 
merical  integral.  The  log-likelihood  can  be  numerically 
maximized  with  a  quasi-Newton  routine,  to  obtain  max¬ 
imum-likelihood  estimates  of  the  parameters. 

Results 

The  characteristics  of  the  study  population  are  presented 
in  table  1.  The  mean  ±  SE  age  at  onset  of  symptoms 
was  22  ±  1  years,  and  the  mean  age  at  diagnosis  was 
27  ±  1  years.  The  median  length  of  follow-up  from 
initial  clinical  evaluation  was  7  years  (range  0-37  years). 
Ninety-eight  percent  of  people  were  diagnosed  after 
1970.  Ninety-four  percent  of  people  had  VSs,  and  45% 
had  intracranial  meningiomas.  Age  at  diagnosis  did  not 
vary  significantly  by  meningioma  status  (presence  or  ab¬ 
sence  of  intracranial  meningiomas)  (mean  ±  SE  28  ± 
1  years  with  meningiomas  absent,  27  ±  1  years  with 
meningiomas  present).  Constitutional  NF2  mutations 
were  identified  in  120  (56%)  of  the  216  families  that 
were  screened  for  mutations.  Seventy-four  (20%)  of  the 
368  people  died  during  follow-up:  51  of  tumor  burden, 
14  of  complications  in  the  immediate  postoperative  pe¬ 
riod,  3  of  malignancies  arising  from  an  NF2-associated 
tumor,  2  each  of  traffic  accidents  and  suicide,  and  1  each 
of  a  fall  due  to  NF2-associated  imbalance  and  a  myo¬ 
cardial  infarction. 


Figure  3  Kaplan-Meier  survival  curve  (log-rank  test):  type 
of  constitutional  NF2  mutation  (P  =  .0020). 
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Figure  4  Kaplan-Meier  survival  curve  (log-rank  test):  type 
of  treatment  center  (P  =  .0001). 

In  single-predictor  Cox  models,  five  covariates  were 
at  least  moderately  associated  with  the  risk  of  mortality: 
age  at  diagnosis,  intracranial  meningioma  status,  type 
of  constitutional  NF2  mutation,  type  of  treatment  cen¬ 
ter^  and  calendar  year  of  diagnosis.  Kaplan-Meier  sur¬ 
vival  curves  are  presented  in  figures  1-4.  These  covar¬ 
iates  were  included  sequentially  in  multiple-predictor 
Cox  models. 

In  the  clinical  model,  the  best  multiple-predictor 
Cox  model  had  three  covariates,  which  were,  in  order 
of  importance,  age  at  diagnosis,  type  of  treatment 
center,  and  meningioma  status.  The  relative  risk  (RR) 
of  mortality  increased  1.13-fold  per  year  decrease  in 
age  at  diagnosis  (95%  Cl  1.08-1.18)  and  was  2.51-fold 
greater  in  people  with  meningiomas  compared  with 
those  without  meningiomas  (95%  Cl  1.38-4.57).  The 
RR  of  mortality  in  people  treated  at  specialty  centers 
was  0.34  compared  with  those  treated  at  nonspecialty 
centers  (95%  Cl  0.12-0.98).  The  coefficients  for  these 
covariates  were  stable  across  Cox  models  with  different 
sets  of  predictors.  See  table  2  for  the  summary  of  this 
Cox  model;  SEs  were  based  on  the  jackknife  method 

Table  2 


with  the  family  as  the  sampling  unit.  Calendar  year  of 
diagnosis  had  little  additional  predictive  value  after  age 
at  diagnosis  was  included.  In  a  separate  model  that  in¬ 
cluded  the  covariates  of  age  at  diagnosis  and  type  of 
treatment  center  but  substituted  number  of  menin¬ 
giomas  for  presence  of  meningiomas,  the  RR  of  mor¬ 
tality  did  not  significantly  increase  with  increasing  num¬ 
ber  of  meningiomas;  however,  each  group  had  relatively 
few  people  (61  people  with  one  meningioma,  63  with 
two  or  three  meningiomas,  and  35  with  more  than  three 
meningiomas). 

In  the  molecular  model,  the  best  multiple-predictor 
Cox  model  had  three  covariates,  which  were,  in  order 
of  importance,  age  at  diagnosis,  type  of  treatment  cen¬ 
ter,  and  mutation  type.  Meningioma  status  had  little 
additional  predictive  value  after  these  three  variables 
were  in  the  model.  See  table  3  for  a  summary  of  this 
Cox  model;  the  /3s  for  the  four  mutation  types  are  rel¬ 
ative  to  nonsense  or  frameshift  mutations.  The  main 
conclusion  is  that  people  with  missense  mutations  have 
a  much  lower  risk  of  mortality  than  do  people  with  any 
of  the  other  types  of  mutations.  This  can  be  seen  from 
the  number  of  deaths  among  people  with  the  five  types 
of  mutations:  15  of  80  for  nonsense  or  frameshift  mu¬ 
tations,  15  of  55  for  splice-site  mutations,  9  of  47  for 
large  deletions,  19  of  114  for  unidentified  mutations, 
and  1  of  22  for  missense  mutations.  Because  there  is 
only  one  death  in  the  missense  mutation  group,  the 
jackknife  does  not  provide  a  reliable  SE  estimate  for  /3 
for  the  missense  mutation  group;  the  data  in  table  3  for 
this  variable  are  based  on  the  omission  of  two  jackknife 
outliers. 

For  the  random-effects  model  for  log  survival  time, 
the  covariates  of  age  at  diagnosis,  type  of  treatment 
center,  and  presence  of  meningiomas  were  fitted.  The 
summary  of  the  parameter  estimates  is  given  in  table 
4.  The  estimated  intrafamilial  correlation  is  0.56  = 
1.257(1  +  1.25),  with  a  wide  95%  Cl  of  0.0-0.73.  The 
data  set  is  small  and  has  a  high  censoring  rate,  so  the 
intrafamilial  correlation  cannot  be  estimated  precisely. 
However,  the  random-effects  model  appears  to  adapt 
well  to  censored  data. 

Discussion 

In  the  present  study,  we  found  that  four  covariates  were 
informative  predictors  of  the  risk  of  mortality  in  people 


Parameter  Estimates  and  SEs  for  the  Clinical  Model  (261  Families  with  NF2) 

Variable  0  SE*  RR  95%  Cl^ 

Age  at  diagnosis  (per  year  decrease)  .12  .02  1.13  1.08-1.18 

Intracranial  meningiomas  (present  vs.  absent)  .92  .31  2.51  1.38-4.57 

Type  of  treatment  center  (specialty  vs.  nonspecialty)  —1,07  .54  .34  .12-.98 
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Table  3 


Parameter  Estimates  and  SEs  for  the  Molecular  Model  (216  Families  with  NF2) 


Variable 

0 

SE, 

RR 

95%  CIrr 

Age  at  diagnosis  (per  year  decrease) 

.14 

.03 

1.15 

1.09-1.21 

Type  of  treatment  center  (specialty  vs.  nonspecialty) 

Type  of  constitutional  NF2  mutation  (vs.  nonsense  or  frameshift): 

-1.21 

.63 

.30 

.08-1.05 

Splice-site 

.22 

.67 

1.24 

.33-4.70 

Missense 

-2.49 

.51 

.08 

.03-.23 

Large  deletion 

-.29 

.49 

.75 

.28-2.00 

Unidentified 

-.31 

.51 

.73 

.26-2.00 

with  NF2.  The  risk  df  mortality  increased  with  decreasing 
age  at  diagnosis  and  was  greater  in  people  with  intracra¬ 
nial  meningiomas  compared  with  those  without  menin¬ 
giomas.  The  risk  of  mortality  was  lower  in  people  with 
constitutional  NF2  missense  mutations  than  in  those  with 
other  types  of  mutations  and  in  people  who  were  treated 
at  specialty  centers  compared  with  those  who  were  treated 
at  nonspecialty  centers.  The  simplicity  of  age  at  diagnosis, 
by  far  the  strongest  single  predictor  of  risk  of  mortality 
in  NF2,  makes  it  a  useful  index  for  patient  counseling 
and  clinical  management;  intracranial  meningiomas  are 
also  useful  in  this  regard.  A  possible  reason  for  the  higher 
risk  of  mortality  in  people  with  NF2  who  are  diagnosed 
at  younger  ages  is  that  tumor  growth  is  generally  more 
rapid  in  these  patients  (Baser  et  al.  2002;  Mautner  et  al. 
2002),  perhaps  because  of  a  higher  rate  of  somatic  cell 
growth  or  a  larger  proportion  of  growing  cells  in  young 
people.  Age  at  onset  (which  is  highly  correlated  with  age 
at  diagnosis)  and  number  of  non-VS  intracranial  tumors 
are  key  indices  of  NF2  disease  severity  (Parry  et  al.  1994). 
Other  covariates,  such  as  sex,  inheritance,  and  year  of 
diagnosis,  were  not  as  useful  predictors  of  the  risk  of 
mortality.  However,  as  more  information  is  collected  on 
people  with  NF2  in  the  future,  the  assessment  of  the  risk 
of  mortality  should  be  repeated  with  these  covariates, 
since  it  is  possible  that  the  present  data  are  insufficient  to 
detect  some  associations. 

In  the  clinical  model,  the  empirical  importance  of 
the  type  of  treatment  center  is  comparable  to  that  of 
meningioma  status.  The  effect  of  treatment  in  specialty 
centers  is  not  due  to  marked  differences  in  the  char¬ 
acteristics  of  people  who  were  seen  at  specialty  and 
nonspecialty  treatment  centers.  New  mutations  in  these 
two  groups  were  similar  with  respect  to  age  at  onset  of 
symptoms,  age  at  diagnosis,  and  prevalence  of  menin¬ 
giomas.  Similar  proportions  of  people  with  inherited 
mutations  were  treated  at  specialty  and  nonspecialty 
centers. 

In  all  likelihood,  a  major  cause  of  the  lower  risk  of 
mortality  in  people  with  NF2  who  are  treated  in  spe¬ 
cialty  centers  is  the  increasing  rate  of  favorable  opera¬ 
tive  outcomes  and  the  decreasing  rate  of  postoperative 
complications  with  increasing  surgical  experience. 


When  new  surgical  teams  are  being  trained  for  VS  sur¬ 
gery,  there  are  clear  trends,  as  the  number  of  surgeries 
increases,  toward  improved  postoperative  preservation 
of  facial  nerve  function,  complete  resection  rate,  and 
hearing  preservation,  as  well  as  a  lowered  incidence  of 
serious  complications,  such  as  cerebrospinal  fluid  leaks 
(Buchman  et  al.  1996;  Welling  et  al.  1999).  In  Denmark, 
decentralized  VS  neurosurgery  was  associated  with  very 
high  rates  of  perioperative  mortality  (8.5%)  and  serious 
surgical  complications  (35.6%)  (Charabi  et  al.  1992). 
People  with  benign  meningioma  who  are  treated  in  ac¬ 
ademic  hospitals  have  significantly  lower  mortality,  af¬ 
ter  adjustment  for  other  risk  factors,  than  do  those  who 
are  treated  in  community  hospitals  (McCarthy  et  al. 
1998).  In  addition  to  more  extensive  surgical  experi¬ 
ence,  specialty  centers  have  coordinated  expertise  in  the 
multiple  clinical  specialties  that  are  needed  to  properly 
diagnose  NF2  and  to  treat  affected  individuals  and  their 
at-risk  family  members  (Evans  et  al.  1993;  Jackler  1998). 
To  ensure  optimal  care,  we  recommend  that  people  with 
NF2  be  referred  to  specialty  centers. 

Almost  all  of  the  people  in  this  study  were  diagnosed 
after  1970.  During  the  last  three  decades,  there  have 
been  considerable  improvements  in  neuroimaging  tech¬ 
niques  that  have  permitted  earlier  detection  of  small 
tumors.  In  combination  with  improvements  in  neuro¬ 
surgical  treatments,  this  has  led  to  better  clinical  man¬ 
agement  and  a  much  greater  incentive  to  diagnose  VSs 
and  NF2.  The  noninclusion  of  the  more  recent  years  of 
diagnosis  in  the  multiple-predictor  Cox  models  does  not 
suggest  that  improvements  in  clinical  care  lack  benefit. 


Table  4 

Parameter  Estimates  and  SEs  for  the  Random-Effects  Mode!  for  Log 
Survival  (261  Families  with  NF2) 


Variable 

Estimate 

SE 

<rE 

.18 

.03 

0 

1.25 

.73 

0o 

3.15 

.06 

Age  at  diagnosis  (per  year  increase) 

-.23 

.02 

Intracranial  meningiomas  (present  vs.  absent) 

-.07 

.04 

Type  of  treatment  center  (specialty  vs.  nonspecialty) 

.21 

.06 
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In  all  likelihood,  insufficient  time  has  elapsed  for  such 
benefit  to  be  reflected  in  decreased  mortality.  In  ad¬ 
dition,  the  benefits  from  incremental  improvements 
in  clinical  care  throughout  the  post-1970  era  may  be 
more  subtle,  with  respect  to  risk  of  mortality,  than  those 
that  occurred  with  the  advent  of  computerized  tomog¬ 
raphy  scanning  in  the  early  1970s. 

Of  the  people  with  NF2  who  have  access  to  treatment, 
relatively  few  die  of  their  VSs.  In  the  modern  era  of 
improved  microsurgical  techniques,  operative  mortality 
in  specialized  neuro-otology  treatment  centers  is  ^1%, 
and  recurrence  rates  are  nil  when  the  entire  VS  and 
vestibular  nerves  are  excised.  Intracranial  meningiomas 
and  spinal  tumors  are  common  in  NF2,  and  these  re¬ 
current  tumors  often  require  repeated  surgeries.  There 
is  considerable  pre-  and  postoperative  morbidity  due  to 
seizures,  paralysis,  wasting,  pneumonia,  and  accidents 
associated  with  meningiomas  and  spinal  tumors  (Evans 
et  al.  2000). 

In  the  molecular  model,  the  risk  of  mortality  appeared 
to  be  much  lower  in  people  with  missense  mutations 
than  in  those  with  other  types  of  mutations.  Experi¬ 
mental  studies  have  suggested  possible  mechanisms 
through  which  constitutional  NF2  missense  mutations 
cause  milder  disease  than  do  nonsense  or  frameshift 
NF2  mutations.  Missense  mutations  produce  an  NF2 
protein  (termed  “merlin”  or  “schwannomin”)  that  is 
stable  but  defective  in  negative  growth  regulation, 
whereas  nonsense  mutations  do  not  produce  stable  pro¬ 
tein  (Gutmann  et  al.  1998).  Naturally  occurring  mis¬ 
sense  mutations  have  reduced,  but  not  absent,  activity 
of  the  merlin-binding  protein  /311-spectrin  (Scoles  et  al. 
1998). 

Since  NF2  mutation  type  is  strongly  associated  with 
age  at  diagnosis  in  cross-sectional  studies,  a  logical  ques¬ 
tion  is  why  age  at  diagnosis  is  more  strongly  associated 
than  the  type  of  NF2  mutation  with  the  risk  of  mor¬ 
tality.  Age  at  diagnosis  may  be  more  strongly  associated 
because  both  age  at  diagnosis  and  age  at  death  reflect 
a  composite  of  disease-influencing  factors,  whereas  mu¬ 
tation  type  is  only  one  of  these  factors.  Other  factors 
are  the  stochastic  loss  of  the  second  NF2  allele  (Baser 
et  al.  1996)  and  putative  modifying  genes  (Bruder  et  al. 
1 999a,  1 999b). 

Constitutional  NF2  mutations  that  are  not  found  by 
SSCP  could  be  mutations  in  the  3'  or  5'  UTRs,  the  pro¬ 
moter  region,  or  untranscribed  transcriptional  control 
elements;  intronic  mutations  that  are  not  covered  by 
SSCP  primers;  large  deletions,  insertions,  or  other  re¬ 
arrangements;  or  mutations  in  patients  who  are  somatic 
mosaics  (Zucman-Rossi  et  al.  1998).  Other  epigenetic 
events  (i.e.,  methylation)  could  result  in  loss  of  NF2 
expression  (Kino  et  al.  2001).  There  is  no  evidence  for 
locus  heterogeneity  in  NF2  (Narod  et  al.  1992).  In  60 
United  Kingdom  families  that  have  people  with  NF2  in 
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two  or  more  generations,  all  families  have  linkage  to 
NF2,  and  NF2  mutations  have  been  identified  in  all  but 
six  families  (D.G.R.E.,  unpublished  data). 

Approximately  20%  of  NF2  new  mutations  are 
thought  to  be  somatic  mosaics,  and  almost  all  have 
mild  disease,  despite  having  constitutional  nonsense 
or  frameshift  NF2  mutations  (Evans  et  al.  19986;  Kluwe 
and  Mautner  1998).  We  excluded  known  somatic  mo¬ 
saics  from  the  analysis,  and,  although  some  somatic 
mosaics  probably  were  not  detected,  the  resultant  bias 
is  likely  to  be  minor.  Among  the  187  people  with  new 
mutations  who  underwent  molecular  screening,  path¬ 
ogenic  NF2  mutations  were  not  identified  in  88.  Sixty- 
eight  of  these  people  with  new  mutations  were  ^20 
years  of  age  at  the  onset  of  symptoms,  or  they  had  two 
or  more  meningiomas  or  four  or  more  spinal  tumors. 
These  people  are  unlikely  to  be  mosaic,  because  they 
have  severe  disease.  The  remaining  20  people  with  new 
mutations  had  mild  disease.  If  20%  of  them  were  mo¬ 
saic,  then  we  failed  to  identify  only  four  mosaic  indi¬ 
viduals  in  the  screened  group*. 

In  summary,  the  strongest  single  predictor  of  the  risk 
of  mortality  in  NF2  is  age  at  diagnosis,  which  is  a  useful 
index  for  patient  counseling  and  clinical  management, 
as  is  meningioma  status.  People  with  constitutional  NF2 
missense  mutations  appear  to  have  a  much  lower  risk 
of  mortality  than  do  those  with  other  types  of  mu¬ 
tations.  Our  finding  that  people  with  NF2  who  are 
treated  at  specialty  centers  have  a  lower  risk  of  mor¬ 
tality  is  consistent  with  studies  of  unilateral  sporadic 
VS,  in  which  surgical  outcomes  were  improved  and  op¬ 
erative  complications  were  reduced  in  proportion  to  in¬ 
creasing  surgical  experience.  To  ensure  optimal  care,  we 
recommend  that  people  with  NF2  be  referred  to  spe¬ 
cialty  treatment  centers. 
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Evaluation  of  clinical  diagnostic  criteria 
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Abstract — Background:  Four  sets  of  clinical  diagnostic  criteria  for  neurofibromatosis  2  (NF2)  have  been  developed  by 
groups  of  expert  clinicians,  but  sensitivity  has  never  been  formally  assessed.  The  sets  of  criteria  differ  for  people  without 
bilateral  vestibular  schwannomas,  which  are  pathognomonic  for  NF2.  Objective:  To  empirically  evaluate  the  four  existing 
sets  of  clinical  diagnostic  criteria  for  NF2.  Methods:  The  study  was  based  on  163  of  403  people  in  the  United  Kingdom  NF2 
registry  (41%)  who  presented  without  bilateral  vestibular  schwannomas.  The  authors  applied  the  sets  of  criteria  to  each 
person  at  initial  assessment  and  at  the  most  recent  clinical  evaluation  (mean  ±  SE  length  of  follow-up,  13  ±  1  years). 
Results:  In  people  with  “definite  NF2”  and  a  negative  family  history  of  NF2,  the  1987  US  NIH  and  1991  NIH  criteria  each 
identify  78%  of  people  at  the  most  recent  clinical  evaluation  but  0%  at  initial  assessment.  The  National  Neurofibromatosis 
Foundation  (NNFF)  criteria  and  the  Manchester  criteria  each  identify  higher  proportions  at  both  time  points  (NNFF 
criteria,  91%  and  10%;  Manchester  criteria,  93%  and  14%),  but  the  proportions  at  initial  assessment  are  still  low. 
Conclusions:  None  of  the  existing  sets  of  criteria  are  adequate  at  initial  assessment  for  diagnosing  people  who  present 
without  bilateral  vestibular  schwannomas  as  having  NF2,  particularly  people  with  a  negative  family  history  of  NF2.  The 
authors  recommend  that  a  single,  revised  set  of  diagnostic  criteria  be  devised  to  replace  all  of  the  existing  sets  of  criteria. 
NEUROLOGY  2002;59:1759-1765 


Neurofibromatosis  2  (NF2)  is  an  autosomal  domi¬ 
nant  disease,  caused  by  inactivating  mutations  of  the 
NF2  tumor  suppressor  gene,1-2  with  an  incidence  of  1 
in  33,000  to  40,000  live  births.3  Clinical  features  of 
NF2  typically  include  nervous  system  tumors  (ves¬ 
tibular  schwannomas,  intracranial  meningiomas, 
spinal  tumors,  and  peripheral  nervous  system  tu¬ 
mors),  and  ocular  abnormalities.4*6  In  1987,  a  US 
NIH  Consensus  Conference  established  clinical  diag¬ 
nostic  guidelines  to  differentiate  NF2  from  NF1.7  Al¬ 
though  the  two  diseases  were  then  known  to  be 
genetically  distinct,8  their  frequent  confusion  in  the 
medical  literature  and  the  need  to  avoid  misdiagno¬ 
sis  in  linkage  studies  prompted  the  development  of 
these  criteria. 

As  the  full  clinical  spectrum  of  NF2  became  better 
defined  by  molecular  analysis  and  neuroimaging,  the 
1987  NIH  criteria  proved  to  be  too  restrictive  for  use 
in  routine  diagnosis.  Revisions  to  the  criteria  were 
recommended  by  a  second  NIH  Consensus  Confer¬ 
ence  in  1991, 9  the  Manchester  group  in  1992, 4  and  a 
group  organized  by  the  National  Neurofibromatosis 
Foundation  (NNFF)  in  1997.10  Each  set  of  criteria 
was  developed  by  a  group  of  expert  clinicians,  but 
the  sensitivity,  specificity,  and  clinical  utility  of 
these  diagnostic  guidelines  have  never  been  formally 
assessed. 

All  four  sets  of  criteria  diagnose  NF2  in  people 


with  bilateral  vestibular  schwannomas  and  in  people 
with  a  first-degree  relative  with  NF2  and  either  a 
unilateral  vestibular  schwannoma  at  less  than  30 
years  of  age  or  at  least  two  other  characteristic  dis¬ 
ease  features  of  NF2  (meningioma,  schwannoma,  gli¬ 
oma,  presenile  cataract).  The  diagnostic  systems 
differ  in  other  respects  (table  1).  Half  of  all  people 
with  NF2  have  a  family  history  of  the  disease.4 

The  purpose  of  this  study  was  to  examine  the  diag¬ 
nostic  efficiency  of  the  four  sets  of  criteria  in  people 
who  do  not  have  bilateral  vestibular  schwannomas  but 
who  do  have  other  signs  of  NF2.  We  found  that  none  of 
the  existing  sets  of  criteria  is  satisfactory  for  diagnos¬ 
ing  such  people  at  initial  assessment.  Overall  and  at 
the  time  of  the  most  recent  clinical  evaluation,  the 
Manchester  criteria  are  the  most  sensitive. 

Methods.  The  study  population  was  selected  from  the 
United  Kingdom  NF2  registry.  NF2  patients  are  ascer¬ 
tained  by  contacting  neurosurgeons,  otolaryngologists, 
neurologists,  pediatricians,  dermatologists,  and  geneticists 
throughout  the  UK.  Patients  are  also  identified  through 
the  Regional  Cancer  Registry  in  the  North  West  Region. 
As  of  1  April  2002,  the  registry  had  clinical  and  molecular 
information  on  427  people  with  proven  or  suspected  NF2 
from  282  families.  We  excluded  asymptomatic  at-risk 
members  of  NF2  families  who  were  diagnosed  through  ge¬ 
netic  screening  and  did  not  have  clinical  information  (n  = 
13)  and  11  other  people  with  insufficient  clinical  informa- 
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Table  1  Clinical  diagnostic  criteria  for  NF2 

1987  NIH  criteria 

A.  Bilateral  vestibular  schwannomas 

B.  First-degree  family  relative  with  NF2  and  unilateral 
vestibular  schwannoma  or  any  two  of:  meningioma, 
schwannoma,  glioma,  neurofibroma,  juvenile  posterior 
subcapsular  lenticular  opacity 

1991  NIH  criteria 

A.  Bilateral  vestibular  schwannomas 

B.  First-degree  family  relative  with  NF2  and  unilateral 
vestibular  schwannoma  or  any  one  of:  meningioma, 
schwannoma,  glioma,  neurofibroma,  juvenile  posterior 
subcapsular  lens  opacity 

Manchester  criteria* 

A.  Bilateral  vestibular  schwannomas 

B.  First-degree  family  relative  with  NF2  and  unilateral 
vestibular  schwannoma  or  any  two  of:  meningioma, 
schwannoma,  glioma,  neurofibroma,  posterior  subcapsular 
lenticular  opacities 

C.  Unilateral  vestibular  schwannoma  and  any  two  of: 
meningioma,  schwannoma,  glioma,  neurofibroma,  posterior 
subcapsular  lenticular  opacities 

D.  Multiple  meningiomas  (two  or  more)  and  unilateral  vestibular 
schwannoma  or  any  two  of:  schwannoma,  glioma, 
neurofibroma,  cataract 

NNFF  criteriat 

A.  Confirmed  or  definite  NF2 

1.  Bilateral  vestibular  schwannomas 

2.  First-degree  family  relative  with  NF2  tind  unilateral 
vestibular  schwannoma  at  less  than  30  y  of  age  or  any  two  of: 
meningioma,  schwannoma,  glioma,  juvenile  lens  opacity 
(posterior  subcapsular  cataract  or  cortical  cataract) 

B.  Presumptive  or  probable  NF2 

1.  Unilateral  vestibular  schwannoma  at  less  than  30  y  of  age 
and  at  least  one  of:  meningioma,  schwannoma,  glioma, 
juvenile  lens  opacity  (posterior  subcapsular  cataract  or  cortical 
cataract) 

2.  Multiple  meningiomas  (two  or  more)  and  unilateral  vestibular 
schwannoma  at  less  than  30  y  of  age  or  at  least  one  of: 
schwannoma,  glioma,  juvenile  lens  opacity  (posterior 
subcapsular  cataract  or  cortical  cataract) 

*  In  the  Manchester  criteria,  “any  two  of*  refers  to  two  individual 
tumors  or  cataract,  whereas  in  the  other  sets  of  criteria,  it  re¬ 
fers  to  two  tumor  types  or  cataract. 

t  For  the  purposes  of  this  study,  the  NNFF  criteria  for  confirmed 
or  definite  NF2  and  for  presumptive  or  probable  criteria  were 
considered  to  be  equivalent. 

NF2  =  neurofibromatosis  2;  NNFF  =  National  Neurofibromato¬ 
sis  Foundation. 


tion  for  this  study.  Of  the  remaining  403  people,  240  (59%) 
had  bilateral  vestibular  schwannomas  at  initial 
assessment. 

This  study  was  based  on  the  163  people  (41%)  who  did 
not  have  bilateral  vestibular  schwannomas  at  initial  as¬ 
sessment  (108  new  mutations  and  55  inherited  cases;  138 
NF2  families).  Of  these  163  people,  64  had  left  and  right 
vestibular  schwannomas  diagnosed  at  the  same  time  but 
previously  presented  with  other  NF2-related  abnormalities 


(meningioma,  schwannoma,  glioma,  neurofibroma,  prese- 
nile  cataract),  42  had  only  a  unilateral  vestibular  schwan¬ 
noma  during  follow-up,  40  presented  with  a  unilateral 
vestibular  schwannoma  but  developed  a  contralateral  ves¬ 
tibular  schwannoma  during  follow-up,  and  17  did  not  have 
any  vestibular  schwannomas  during  follow-up.  Of  the  104 
people  who  had  bilateral  vestibular  schwannomas  by  the 
end  of  follow-up,  left  and  right  vestibular  schwannomas 
were  diagnosed  at  the  same  time  in  64  people,  but  there 
was  a  delay  of  1  to  5  years  in  22  people,  6  to  10  years  in  12 
people,  11  to  15  years  in  4  people,  and  more  than  15  years 
in  2  people. 

The  NNFF  criteria  have  separate  categories  for  con¬ 
firmed  or  definite  NF2  and  presumptive  or  probable  NF2 
(see  table  1).  For  the  purposes  of  this  study,  these  two 
categories  were  considered  to  be  equivalent.  In  new  muta¬ 
tions  from  the  United  Kingdom  NF2  registry  that  were 
screened  for  constitutional  NF2  mutations,  mutations 
were  found  in  57%  of  21  people  who  met  the  NNFF  criteria 
for  presumptive  or  probable  NF2,  a  fraction  similar  to  the 
55%  of  168  people  with  bilateral  vestibular  schwannomas. 

At  the  end  of  follow-up,  59  people  had  a  first-degree 
relative  with  NF2.  Seventeen  of  these  people  did  not  have 
an  affected  first-degree  relative  at  initial  assessment.  The 
affected  first-degree  relative  was  a  parent  in  53  patients 
and  an  offspring  in  6  patients.  In  only  one  instance  was 
the  parent  initially  assessed  after  the  offspring  was  diag¬ 
nosed  with  NF2.  An  additional  six  people  died  before  their 
first-degree  relative  was  diagnosed  with  NF2.  For  the  pur¬ 
poses  of  this  study,  these  people  were  classified  as  not 
having  an  affected  relative  during  follow-up,  but  this  infor¬ 
mation  was  considered  in  determining  inheritance. 

The  163  people  who  did  not  have  bilateral  vestibular 
schwannomas  at  initial  assessment  were  divided  into  two 
groups.  People  with  “definite  NF2”  were  those  who  met  the 
specific  criteria,  by  the  end  of  follow-up,  that  all  four  sets 
of  clinical  diagnostic  criteria  have  in  common.  In  each  set 
of  criteria,  persons  are  diagnosed  with  NF2  if  they  have 
bilateral  vestibular  schwannomas  or  a  first-degree  relative 
with  NF2  and  either  a  unilateral  vestibular  schwannoma 
at  less  than  30  years  of  age  or  two  other  characteristic  NF2 
disease  feature  types  (meningioma,  schwannoma,  glioma, 
presenile  cataract).  Identified  constitutional  NF2  muta¬ 
tions  were  also  used  to  define  “definite  NF2.”  One  hundred 
forty-one  people  were  classified  as  having  “definite  NF2”: 
104  had  bilateral  vestibular  schwannomas  by  the  end  of 
follow-up,  98  had  identified  constitutional  NF2  mutations,  8 
had  a  first-degree  relative  with  NF2  and  a  unilateral  vestib¬ 
ular  schwannoma  at  less  than  30  years  of  age,  and  6  had  an 
affected  first-degree  relative,  no  vestibular  schwannomas, 
but  at  least  two  other  characteristic  types  of  NF2  disease 
features.  People  with  “possible  NF2”  (n  =  22)  had  some 
characteristic  features  of  NF2  but  did  not  meet  the  criteria 
for  “definite  NF2.”  The  length  of  follow-up  was  not  signifi¬ 
cantly  different  between  people  with  “definite  NF2”  and 
those  with  “possible  NF2”  (mean  ±  SE,  13  ±  1  and  11  ±  2 
years). 

The  four  sets  of  diagnostic  criteria  were  applied  to  these 
two  groups  to  determine  the  proportion  of  people  who  met 
each  set  of  criteria.  We  evaluated  the  diagnostic  criteria  at 
two  time  points:  initial  assessment  and  most  recent  clini¬ 
cal  evaluation.  The  United  Kingdom  NF2  registry  has  the 
age  at  diagnosis  of  each  tumor  type  and  the  age  at  which 
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Table  2  Four  sets  of  NF2  clinical  diagnostic  criteria  applied  to  163  people  in  the  United  Kingdom  NF2  registry  who  presented  without 
bilateral  vestibular  schwannomas,  by  time  of  evaluation,  disease  group,  and  family  history 


No.  (%)  of  patients  identified  by  time  of  evaluation,  disease  group,t  and  family  history^ 


Initial  assessment  Most  recent  clinical  evaluation§ 


“Definite  NF2”  “Definite  NF2" 


family  history  family  history 


Diagnostic  criteria* 

Negative, 
n  =  99 

Positive, 
n  =  42 

“Possible  NF2,” 
n  =  22 

Negative, 
n  =  85 

Positive, 
n  =  56 

“Possible  NF2,’ 
n  —  22 

1987  NIH 

0(0) 

19  (45) 

0(0) 

66  (78) 

55  (98) 

0(0) 

1991 NIH 

0(0) 

42 (100) 

0(0) 

66  (78) 

56  (100) 

3(14) 

NNFF 

10  (10) 

17  (40) 

5(23) 

77  (91) 

54  (96) 

10  (46) 

Manchester 

14  (14) 

29  (69) 

7(32) 

79  (93) 

55  (98) 

22  (100) 

*  See  table  1  for  definition  of  the  different  sets  of  diagnostic  criteria. 

t  Seventeen  first-degree  relatives  of  people  with  “definite  NF2”  were  diagnosed  after  initial  assessment  (14  “definite  NF2W  patients  and 
3  “possible  NF2*  patients). 

$  “Definite  NF2”:  people  who  had  bilateral  vestibular  schwannomas  by  the  end  of  follow-up  or  who  had  a  first-degree  relative  with  NF2 
and  either  a  unilateral  vestibular  schwannoma  at  less  than  30  y  of  age  or  at  least  two  other  characteristic  disease  feature  types  of 
NF2  or  who  had  an  identified  constitutional  NF2  mutation.  “Possible  NF2”:  people  who  had  some  characteristic  disease  features  of 
NF2  but  who  did  not  meet  the  criteria  for  “definite  NF2  ” 

§  The  mean  ±  SE  period  of  follow-up  was  13  ±  1  years. 

NF2  =  neurofibromatosis  2;  NNFF  =  National  Neurofibromatosis  Foundation. 


first-degree  relatives  were  diagnosed,  and  for  the  analysis 
based  on  initial  assessment,  we  considered  only  those  ab¬ 
normalities  and  affected  relatives  that  existed  at  that  time 
point. 

Using  Kaplan-Meier  analysis,  we  determined  the  time 
course,  from  initial  assessment  to  the  most  recent  clinical 
evaluation,  of  the  increasing  proportion  of  people  who 
would  be  diagnosed  with  NF2  using  the  different  sets  of 
criteria  as  a  result  of  new  disease  features  developing  or  a 
first-degree  relative  being  newly  diagnosed.  Only  those 
manifestations  and  affected  first-degree  relatives  that 
were  present  in  each  year  of  follow-up  were  considered. 
Because  the  Kaplan-Meier  curves  for  the  different  sets  of 
diagnostic  criteria  are  based  on  the  same  people,  we  used 
the  jackknife  method,11  with  family  as  the  unit,  to  compute 
pointwise  standard  errors  for  differences  in  proportions  of 
pairwise  Kaplan-Meier  curves. 

Results.  Table  2  presents  the  proportion  of  the  163  peo¬ 
ple  who  presented  without  bilateral  vestibular  schwanno¬ 
mas  at  initial  assessment  that  were  identified  using  each 
set  of  diagnostic  criteria.  At  initial  assessment,  the  1987 
NIH  criteria  and  the  1991  NIH  criteria  each  identified  0% 
of  people  with  “definite  NF2”  and  a  negative  family  history 
of  NF2,  whereas  the  NNFF  criteria  and  the  Manchester 
criteria  identified  greater  but  still  low  proportions  (10% 
and  14%).  The  1991  NIH  criteria  identified  100%  of  people 
with  “definite  NF2”  and  a  positive  family  history  of  NF2, 
whereas  the  other  sets  of  criteria  identified  45%  to  69%. 
The  1987  NIH  criteria  and  the  1991  NIH  criteria  did  not 
identify  any  people  with  “possible  NF2”  as  having  NF2, 
whereas  the  NNFF  criteria  and  the  Manchester  criteria 
identified  greater  but  still  low  proportions  (23%  and  32%). 

As  expected,  greater  proportions  of  people  were  diag¬ 
nosed  with  NF2  using  each  set  of  criteria  at  the  most 


recent  clinical  evaluation  than  at  initial  assessment  (i.e., 
after  an  average  of  13  years  from  initial  assessment).  At 
the  most  recent  clinical  evaluation,  the  1987  NIH  criteria 
and  the  1991  NIH  criteria  each  identified  78%  of  people 
with  “definite  NF2”  and  a  negative  family  history  of  NF2. 
The  NNFF  and  Manchester  criteria  identified  NF2  in 
higher  proportions  of  these  people  (91%  and  93%).  All  four 
sets  of  criteria  identified  high  proportions  of  people  with 
“definite  NF2”  and  a  positive  family  history  of  NF2  (96% 
to  100%).  The  Manchester  criteria  identified  NF2  in  100% 
of  people  with  “possible  NF2  ”  but  the  other  sets  of  criteria 
identified  NF2  in  much  lower  proportions  (0%  to  46%). 

Figure  1  illustrates  the  proportion  of  the  121  people  in 
this  study  with  a  negative  family  history  of  NF2  at  initial 
assessment  that  met  each  of  the  four  sets  of  diagnostic 
criteria  with  increasing  length  of  time  from  initial  assess¬ 
ment,  using  Kaplan-Meier  analysis.  At  1  year  from  initial 
assessment,  the  proportion  of  people  that  met  the  diagnos¬ 
tic  criteria  for  NF2  was  27%  using  the  Manchester  criteria, 
23%  using  the  NNFF  criteria,  5%  using  the  1991  NIH 
criteria,  and  4%  using  the  1987  NIH  criteria.  At  5  years 
from  initial  assessment,  these  proportions  were  49%  using 
the  Manchester  criteria,  44%  using  the  NNFF  criteria, 
27%  using  the  1991  NIH  criteria,  and  24%  using  the  1987 
NIH  criteria. 

Table  3  presents  the  differences  in  proportions  of  people 
identified  as  having  NF2  in  comparisons  between  the  dif¬ 
ferent  sets  of  diagnostic  criteria  (pointwise  comparisons, 
corresponding  to  jump  points  in  the  Kaplan-Meier  curves). 
For  the  Manchester  criteria  and  the  1991  NIH  criteria,  the 
difference  in  proportions  was  significantly  greater  than 
zero  from  initial  assessment  to  25  years  after  initial  as¬ 
sessment.  For  the  NNFF  criteria  and  the  1991  NIH  crite¬ 
ria,  the  difference  in  proportions  was  significantly  greater 
than  zero  from  initial  assessment  to  15  years  after  initial 
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Time  since  initial  clinical  evaluation  (years) 

Figure  1.  Proportion  of  121  people  with  a  negative  family 
history  of  neurofibromatosis  2  (NF2)  at  initial  assessment 
that  meet  the  four  sets  of  clinical  diagnostic  criteria  with 
increasing  length  of  time  from  initial  assessment  (Kaplan - 
Meier  analysis).  Solid  line  —  Manchester  criteria;  dashed 
line  —  National  Neurofibromatosis  Foundation  criteria; 
dash- dot  line  —  1991  N1H  criteria;  dotted  line  —  1987 
NIH  criteria. 


assessment.  For  the  Manchester  criteria  and  the  NNFF 
criteria,  the  difference  in  proportions  was  significantly 
greater  than  zero  at  initial  assessment  and  from  5  to  20 
years  after  initial  assessment. 

There  were  8  known  somatic  mosaics  (determined 
through  molecular  analysis)  among  the  98  new  mutations 
that  were  screened  for  constitutional  NF2  mutations.  At 
initial  assessment,  only  the  Manchester  criteria  identified 
any  of  the  mosaics  (one)  as  having  NF2.  At  the  most  recent 
clinical  evaluation,  the  1987  NIH  criteria  and  1991  NIH 
criteria  each  identified  four  mosaics  as  having  NF2,  the 
NNFF  criteria  identified  five  mosaics,  and  the  Manchester 
criteria  identified  all  eight  mosaics.  The  median  age  at 
onset  of  symptoms  in  the  eight  mosaics  was  28  years 
(range  23  to  46  years)  and  the  median  age  at  diagnosis  was 
43  years  (range  27  to  62  years).  At  initial  assessment,  four 
of  the  mosaics  had  unilateral  vestibular  schwannomas,  two 
had  peripheral  nervous  system  schwannomas,  one  had  spi¬ 
nal  tumors,  and  one  had  cataracts.  At  the  most  recent 
clinical  evaluation,  four  of  the  mosaics  had  unilateral  ves¬ 
tibular  schwannomas,  four  had  bilateral  vestibular  schw¬ 
annomas,  four  had  spinal  tumors,  two  had  peripheral 
nervous  system  schwannomas,  two  had  intracranial  me¬ 
ningiomas,  two  had  cataracts,  and  one  had  a  non-VIII 
nerve  cranial  nerve  tumor. 

Discussion.  Bilateral  vestibular  schwannomas  are 
pathognomic  for  NF2,  but  163  of  the  403  people  with 
clinical  information  in  the  United  Kingdom  NF2  reg¬ 
istry  (41%)  did  not  have  bilateral  vestibular  schwan¬ 
nomas  at  initial  assessment.  The  four  sets  of  clinical 
diagnostic  criteria  identify  different  proportions  of 
these  163  people  as  having  NF2,  in  both  “definite 
NF2”  and  “possible  NF2”  patients.  People  with  a 


negative  family  history  of  NF2  at  initial  assessment 
present  the  greatest  diagnostic  difficulties.  By 
Kaplan-Meier  analysis,  the  Manchester  criteria  and 
the  NNFF  criteria  each  identify  significantly  higher 
proportions  of  these  people  than  the  1991  NIH  crite¬ 
ria,  but  the  Manchester  criteria  usually  identify  a 
significantly  higher  proportion  than  the  NNFF 
criteria. 

The  1991  NIH  criteria  identify  the  highest  propor¬ 
tion  of  people  with  “definite  NF2”  and  a  positive 
family  history  of  NF2  because  this  set  of  criteria 
requires  only  one  instead  of  two  characteristic  dis¬ 
ease  features  to  establish  the  diagnosis  in  people 
with  a  family  history  of  the  disease.  However,  the 
1987  NIH  criteria  and  the  1991  NIH  criteria  each 
require  a  positive  family  history  to  diagnose  NF2  in 
people  who  do  not  have  bilateral  vestibular  schwan¬ 
nomas.  As  a  result,  the  NIH  criteria  identify  the 
lowest  proportion  of  people  with  “definite  NF2”  and  a 
negative  family  history  of  NF2  and  few  people  with 
“possible  NF2.”  The  three  people  with  “possible  NF2” 
who  were  identified  at  the  most  recent  clinical  eval¬ 
uation  using  the  1991  NTH  criteria  each  had  a  family 
member  who  had  been  diagnosed  with  NF2  after 
initial  assessment,  no  vestibular  schwannomas,  or  a 
unilateral  vestibular  schwannoma  at  age  30  or  older 
and  one  other  characteristic  disease  feature  of  NF2. 

Unlike  the  NIH  criteria,  a  diagnosis  of  NF2  can  be 
made  using  the  Manchester  criteria  and  the  NNFF 
criteria  in  people  who  do  not  have  bilateral  vestibu¬ 
lar  schwannomas  or  a  family  history  of  NF2  but  who 
do  have  other  characteristic  disease  features.  The 
Manchester  criteria  and  the  NNFF  criteria  have  sim¬ 
ilar  wording,  but  the  Manchester  criteria  identify  a 
higher  proportion  of  people  than  the  NNFF  criteria 
because  the  Manchester  criteria  are  based  on  the 
number  of  disease  features  of  any  type  (i.e.,  the  num¬ 
ber  of  individual  tumors),  whereas  the  NNFF  criteria 
and  the  NIH  criteria  are  based  on  the  number  of 
different  feature  types.  For  example,  in  a  person 
with  two  intracranial  meningiomas,  the  Manchester 
criteria  count  the  two  meningiomas  as  two  disease 
features,  whereas  the  NNFF  criteria  and  the  NIH 
criteria  count  the  two  meningiomas  as  a  single  dis¬ 
ease  feature  because  they  are  the  same  histologic 
type  of  tumor. 

The  Manchester  criteria  also  differ  from  the 
NNFF  criteria  because  the  Manchester  criteria  do 
not  require  an  age  at  diagnosis  of  unilateral  vestibu¬ 
lar  schwannoma  of  less  than  30  years.  This  age  re¬ 
quirement  decreases  sensitivity  and  is  not  needed  to 
increase  specificity  because  people  with  unilateral 
vestibular  schwannoma  who  do  not  have  a  family 
history  of  NF2  or  any  other  characteristic  disease 
features  of  NF2  have  a  very  low  probability  of  devel¬ 
oping  NF2.  The  probability  decreases  from  1%  in 
people  who  are  diagnosed  with  a  unilateral  vestibu¬ 
lar  schwannoma  at  ages  10  to  19  years  to  0.45%  in 
those  aged  20  to  29  years  to  0.15%  in  those  aged  30 
to  39  years.12  Figure  2  presents  data  from  the  UK 
showing  that  54%  of  NF2  patients  with  new  muta- 
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Table  3  Pointwise  comparisons  of  Kaplan-Meier  curves  in  figure  1,  difference  in  proportions  of  people  identified  as  having  NF2  using 
different  sets  of  diagnostic  criteria ,  by  time  since  initial  assessment 


Comparisons  between  sets  of  diagnostic  criteria 


Time  since  initial 
assessment,  y 

Manchester  v  1991  NIH 

NNFF  v  1991  NIH 

Manchester  v  NNFF 

Difference  in 
proportions 

95%  Cl 

Difference  in 
proportions 

95%  Cl 

Difference  in 
proportions 

95%  Cl 

0 

0.174 

0.107-0.241 

0.124 

0.065-0.183 

0.050 

0.005-0.095 

1 

0.215 

0.141-0.289 

0.174 

0.101-0.247 

0.040 

0.000-0.089 

2 

0.230 

0.154-0.306 

0.190 

0.114-0.266 

0.039 

0.000-0.086 

3 

0.236 

0.160-0.312 

0.197 

0.121-0.273 

0.039 

0.000-0.086 

4 

0.240 

0.158-0.322 

0.193 

0.113-0.273 

0.046 

0.000-0.093 

5 

0.233 

0.139-0.307 

0.170 

0.088-0.252 

0.053 

0.010-0.096 

6 

0.234 

0.148-0.320 

0.175 

0.093-0.257 

0.060 

0.015-0.105 

7 

0.228 

0.144-0.312 

0.170 

0.088-0.252 

0.058 

0.015-0.101 

8 

0.191 

0.113-0.269 

0.126 

0.048-0.204 

0.065 

0.020-0.110 

9 

0.196 

0.125-0.270 

0.133 

0.059-0.207 

0.063 

0.020-0.106 

10 

0.195 

0.119-0.271 

0.118 

0.045-0.191 

0.078 

0.031-0.125 

11 

0.221 

0.141-0.301 

0.148 

0.070-0.226 

0.074 

0.029-0.119 

12 

0.199 

0.123-0.275 

0.124 

0.050-0.198 

0.075 

0.030-0.120 

13 

0.177 

0.103-0.251 

0.116 

0.043-0.189 

0.061 

0.014-0.118 

14 

0.158 

0.087-0.229 

0.100 

0.031-0.169 

0.059 

0.012-0.116 

15 

0.151 

0.080-0.222 

0.085 

0.020-0.150 

0.066 

0.023-0.107 

16 

0.140 

0.064-0.216 

0.069 

0.000-0.107 

0.071 

0.020-0.122 

17 

0.144 

0.066-0.222 

0.077 

0.001-0.153 

0.067 

0.016-0.118 

18 

0.135 

0.064-0.206 

0.073 

0.004-0.142 

0.061 

0.012-0.110 

19 

0.123 

0.056-0.192 

0.069 

0.000-0.138 

0.054 

0.070-0.101 

20 

0.118 

0.049-0.187 

0.067 

0.000-0.136 

0.050 

0.003-0.097 

21 

0.111 

0.042-0.180 

0.067 

0.000-0.136 

0.044 

0.000-0.089 

22 

0.095 

0.030-0.160 

0.051 

0.000-0.116 

0.044 

0.000-0.089 

23 

0.062 

0.009-0.115 

0.018 

0.000-0.073 

0.044 

0.000-0.089 

24 

0.058 

0.003-0.113 

0.034 

0.000-0.097 

0.024 

0.000-0.061 

25 

0.060 

0.009-0.111 

0.049 

0.000-0.104 

0.011 

0.000-0.042 

30 

0.052 

0.000-0.148 

0.017 

0.000-0.117 

0.035 

0.000-0.074 

Differences  in  proportions  and  95%  Cl  are  constant  in  the  interval  between  25  and  30  y  after  initial  assessment. 
NNFF  =  National  Neurofibromatosis  Foundation. 


tions  are  diagnosed  with  their  first  vestibular 
schwannoma  at  less  than  30  years  old  compared 
with  only  5%  of  people  with  unilateral  vestibular 
schwannomas,  but  conversely  46%  of  NF2  patients 
with  new  mutations  are  diagnosed  with  their  first 
vestibular  schwannoma  at  30  years  and  older. 
Screening  for  constitutional  NF2  mutations  in  the 
small  fraction  of  people  with  unilateral  vestibular 
schwannoma  who  present  at  younger  than  30  years 
can  identify  people  with  NF2. 12-14 

In  the  current  study,  the  NNFF  age  requirement 
reduced  sensitivity.  Solely  because  of  the  age  re¬ 
quirement,  14  people  who  were  diagnosed  with  uni¬ 
lateral  vestibular  schwannomas  at  30  years  or  older 
were  not  identified  by  the  NNFF  criteria  but  were 


identified  by  the  Manchester  criteria.  There  were  4 
people  with  “definite  NF2  ”  including  3  somatic  mosa¬ 
ics,  and  10  people  with  “possible  NF2.”  In  most  NF2 
somatic  mosaics,  disease  manifestations  are  milder 
and  vestibular  schwannomas  are  of  later  onset  than  in 
people  with  classic  NF2.1517  The  odds  of  somatic  mosa¬ 
icism  in  NF2  increase  11-fold  per  decade  increase  in 
age  at  diagnosis  of  NF2  and  are  7-fold  greater  in  NF2 
patients  with  no  vestibular  schwannomas  or  a  unilat¬ 
eral  vestibular  schwannoma  than  in  those  with  bilat¬ 
eral  vestibular  schwannomas.17 

A  “gold  standard”  does  not  exist  for  distinguishing 
normal  individuals  from  those  who  actually  have 
NF2  in  every  case.  The  current  study  focuses  on 
sensitivity,  but  there  is  considerable  evidence  that 
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Age  at  diagnosis  of  vestibular  schwannoma  (years) 

Figure  2.  Distribution  of  age  at  diagnosis  of  481  people 
with  unilateral  vestibular  schwannoma  in  the  St  Mary's 
Hospital  geographic  catchment  area  (Manchester ,  UK)  and 
age  at  first  vestibular  schwannoma  of  127  NF2  patients 
with  new  mutations  in  the  United  Kingdom  NF2  registry 
(time  period  for  both  groups,  1987  to  1997).  People  with 
known  NF2  are  excluded  from  the  group  with  unilateral 
vestibular  schwannoma,  and  known  NF2  somatic  mosaics 
(defined  through  molecular  testing)  are  excluded  from  the 
NF2  group  with  new  mutations.  The  median  age  at  diag¬ 
nosis  for  people  with  unilateral  vestibular  schwannoma  is 
53  years,  and  the  median  age  at  diagnosis  of  first  vestibu¬ 
lar  schwannoma  for  NF2  patients  with  new  mutations  is 
29  years .  Solid  columns  =  first  NF2  vestibular  schwan¬ 
noma;  cross-hatched  columns  ~  unilateral  vestibular 
schwannoma. 


specificity  is  high  (about  99%)  and  similar  for  each 
set  of  diagnostic  criteria.12 14  As  noted  above,  people 
with  unilateral  vestibular  schwannoma  who  do  not 
have  a  family  history  of  NF2  or  characteristic  dis¬ 
ease  features  of  NF2  have  a  very  low  probability  of 
developing  NF2,  and  this  probability  decreases  with 
increasing  age  at  diagnosis  of  vestibular  schwanno¬ 
ma.1344  Sporadic  unilateral  vestibular  schwannoma 
can  occur  in  two  first-degree  relatives  by  chance, 
giving  the  appearance  of  heritable  vestibular  schw¬ 
annomas.  Sporadic  unilateral  vestibular  schwan¬ 
noma  has  a  lifetime  prevalence  of  about  1  in  1,000, 
and  assuming  that  people  have  an  average  of  five 
first-degree  relatives,  the  lifetime  probability  of  spo¬ 
radic  unilateral  vestibular  schwannoma  occurring  by 
chance  in  two  first-degree  relatives  is  0.5%.12 

Two  people  in  this  study  are  of  particular  interest 
because  of  their  late  age  at  diagnosis  of  their  first 
vestibular  schwannoma.  They  were  diagnosed  with 
their  first  vestibular  schwannoma  at  ages  60  and  72, 
and  each  developed  a  contralateral  vestibular 
schwannoma  2  years  later.  Neither  had  a  family  his¬ 
tory  of  NF2  or  any  other  clinical  feature  of  the  dis¬ 
ease.  One  person  had  molecular  testing  and  was 
found  to  be  mosaic  for  a  constitutional  NF2  frame- 
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shift  mutation.  The  other  person  also  might  be  a 
somatic  mosaic  or  a  rare  person  with  two  sporadic 
unilateral  vestibular  schwannomas  who  does  not 
have  NF2  (this  is  most  likely  in  older  people,  in 
whom  the  incidence  of  unilateral  sporadic  vestibular 
schwannoma  is  highest).18  A  third  possibility  is  that 
the  person  might  have  nonmosaic  NF2  with  unusu¬ 
ally  mild  expression,  perhaps  as  a  result  of  a  “weak” 
mutant  allele.  The  two  adult  children  of  this  person 
do  not  have  clinical  features  of  NF2;  in  such  in¬ 
stances,  family  molecular  genetic  studies  may  be 
useful. 

A  more  sensitive  set  of  diagnostic  criteria  for  NF2 
can  be  developed  by  adding  mononeuropathy  as  a 
clinical  diagnostic  criterion  and  incorporating  the  re¬ 
sults  of  genetic  testing.19  In  the  United  Kingdom 
NF2  registry,  mononeuropathy  (foot  drop,  wrist  drop, 
facial  palsy,  or  III  nerve  palsy)  occurs  in  31%  of 
people  with  an  age  at  onset  of  symptoms  of  less  than 
15  years  but  in  only  3%  of  those  with  an  age  at  onset 
of  15  years  or  older.  NF2-associated  mononeuropathy 
has  been  reported  in  previous  studies,20-24  as  has  a 
more  generalized  peripheral  neuropathy.4’5*25  The  dif¬ 
ferential  diagnosis  should  exclude  polio.4  In  general, 
the  presenting  symptom  in  young  people  with  NF2  is 
less  likely  to  be  vestibular  than  in  adults  with 
2^2. 22’26’27 

NF2  is  99%  penetrant  by  age  60, 28  and  genetic 
testing  to  identify  pathogenic  constitutional  NF2 
mutations  or  NF2  mutation  carriers  can  increase 
sensitivity  with  minimal  decrease  in  specificity.19 
Negative  results  from  single-strand  conformational 
polymorphism  (SSCP)  analysis  on  blood  samples,  a 
commonly  used  method,29  cannot  rule  out  NF2  be¬ 
cause  such  testing  would  not  identify  mutations  in 
the  3'  or  5'  untranslated  regions,  the  promoter  re¬ 
gion,  untranscribed  transcriptional  control  elements, 
or  introns  that  are  not  covered  by  conventional  SSCP 
primers;  large  deletions,  insertions,  or  other  rear¬ 
rangements;  or  mutations  in  patients  who  are  so¬ 
matic  mosaics  if  the  mutant  line  is  not  sufficiently 
abundant  in  the  tissue  tested.30  Other  epigenetic 
events  (i.e.,  methylation)  also  could  result  in  loss  of 
NF2  expression.31  Segregation  analysis  using  tightly 
linked  genetic  markers  can  identify  NF2  mutation 
carriers  in  at-risk  members  of  NF2  families.  Segre¬ 
gation  analysis  is  more  rapid  and  less  expensive 
than  mutation  analysis  and,  in  conjunction  with  clin¬ 
ical  information,  often  can  discriminate  between 
NF2  mutation  carriers  and  noncarriers  with  a  high 
degree  of  certainty. 32)33 

We  recommend  that  a  single  revised  set  of  diag¬ 
nostic  criteria  be  devised  to  replace  all  of  the  existing 
diagnostic  systems  for  NF2.  In  future  work,  we  will 
empirically  validate  our  suggestions  for  revised 
criteria. 
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Neurofibromatosis  2  (NF2)  is  an  autosomal  dominant 
disease  that  is  caused  by  inactivating  mutations  of  the 
NF2  tumour  suppressor  gene.1 2  Multiple  central  and 
peripheral  nervous  system  tumours  and  ocular  abnormalities 
are  common  in  NF2;  bilateral  vestibular  schwannomas  are 
pathognomonic  for  the  disease.  Genotype-phenotype  correla¬ 
tions  are  well  established  for  NF2  associated  tumours.  In 
general,  constitutional  nonsense  or  frameshift  NF2  mutations 
are  associated  with  severe  NF2  (that  is,  earlier  onset  of 
symptoms  and  more  tumours),  splice  site  mutations  with 
variable  disease  severity,  and  mis  sense  mutations  with  mild 
disease. 

Genotype-phenotype  correlations  have  not  been  reported 
for  the  non- tumour  manifestations  of  NF2.  The  most 
common  of  these  manifestations  is  presenile  cataracts,  which 
occur  in  about  60-80%  of  people  with  NF2.3-5  In  animal 
models,  lens  fibre  cells  that  are  more  differentiated  express 
less  Nf2  protein  than  the  epithelial  regions  of  the  lens, 
suggesting  that  the  Nf2  protein  may  play  a  role  in  lens 
epithelial  cell  migration  or  elongation.6  The  purpose  of  this 
study  was  to  determine  if  there  were  genotype-phenotype 
correlations  for  cataracts  in  NF2. 

PATIENTS  AND  METHODS 

The  study  was  based  on  the  United  Kingdom  NF2  registry  in 
the  Department  of  Medical  Genetics,  St  Mary's  Hospital, 
Manchester.  NF2  patients  are  ascertained  by  contacting 
neurosurgeons,  otolaryngologists,  neurologists,  paediatri¬ 
cians,  dermatologists,  and  geneticists  throughout  the 
United  Kingdom,  augmented  in  the  North  West  Region  by 
the  Regional  Cancer  Registry.  The  study  was  subject  to 
continuing  ethics  committee  evaluation  and  patients  con¬ 
sented  to  participation.  Patients  were  screened  for  constitu¬ 
tional  NF2  mutations  using  single  strand  conformational 
polymorphism  analysis  (SSCP)  as  previously  described,7  and 
examined  for  cataracts  using  slitlamp  biomicroscopy  at  the 
time  of  diagnosis  of  NF2.  For  this  study,  cataracts  were 
defined  as  present  or  absent  (that  is,  posterior  subcapsular 
cataracts  and  cortical  cataracts  were  aggregated).  There  were 
255  people  from  190  families  (159  people  with  new 
mutations  and  96  inherited  cases;  132  females  and  123 
males)  (table  1). 

For  univariate  analyses,  Fisher's  exact  test  was  used  for 
binary  variables  and  the  two-tailed  t  test  was  used  for 
continuous  variables.  A  multivariate  probit  model  with  an 
exchangeable  correlation  structure  within  families  was  used 
with  various  sets  of  covariates  to  account  for  possible  familial 
dependence.8  From  a  regression  coefficient  p,  an  approximate 
relative  risk  (RR  =  exp{2*  p})  and  confidence  interval  (Cl) 
for  presence  of  cataracts  can  be  calculated.  In  the  probit 
model,  the  reference  group  in  comparisons  between  people 
with  different  types  of  NF2  mutations  was  people  who  had 
constitutional  nonsense  or  frameshift  NF2  mutations. 

There  is  a  potential  bias  towards  a  lower  age  at  onset  of 
symptoms  or  age  at  diagnosis  in  inherited  cases  owing  to  the 
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family  history  of  the  disease.  In  the  study  group  as  a  whole, 
there  were  no  significant  differences  in  these  ages  between 
people  with  new  mutations  and  inherited  cases  for  any  type 
of  NF2  mutation.  Also,  using  a  probit  model,  the  RR  of 
cataracts  was  not  significantly  associated  with  age  at 
diagnosis  (see  below).  Therefore,  for  all  mutation  categories 
except  unfound  mutations,  we  combined  people  with  new 
mutations  and  inherited  cases.  In  the  large  group  of  people 
with  unfound  mutations,  we  retained  the  division  between 
those  with  new  mutations  and  inherited  disease  because 
people  with  new  unfound  mutations  may  be  somatic 
mosaics.  We  used  age  at  onset  of  symptoms  to  categorise 
people  with  new  unfound  mutations  by  disease  severity 
(severe  disease,  onset  of  symptoms  at  ages  <20  years;  mild 
disease,  onset  of  symptoms  at  ages  2*20  years).3 

RESULTS 

As  expected,  the  mean  age  at  onset  of  symptoms  and  age  at 
diagnosis  were  higher  in  people  with  non- truncating  muta¬ 
tions  and  in  somatic  mosaics  than  in  people  with  classical 
NF2  and  nonsense  or  frameshift  mutations  (table  1).  The 
overall  prevalence  of  cataracts  was  33%,  but  the  prevalence  of 
cataracts  was  significantly  lower  in  somatic  mosaics  and  in 
people  with  new  unfound  mutations  and  onset  of  symptoms 
at  ages  ^20  years  than  in  people  with  classical  NF2  and 
nonsense  or  frameshift  mutations.  In  people  with  cataracts, 
29%  were  diagnosed  with  cataracts  at  ages  <10  years,  and 
47%  at  ages  <20  years  (mean  23  (SE  2)  years).  Seventy 
percent  were  diagnosed  with  cataracts  before  their  first  non¬ 
ocular  sign  or  symptom. 

In  the  multivariate  probit  model  summarised  in  table  2,  the 
RR  of  cataracts  did  not  significantly  increase  with  increasing 
age  at  diagnosis,  after  accounting  for  the  type  of  constitutional 
NF2  mutation.  This  was  also  the  case  in  the  other  probit 
models  (data  not  shown).  This  is  probably  due  to  the 
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Table  1  by  type  of  NF2  triufafan  '  7^ 
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t Excludes  15  inherited  cases  who  were  asymptomatic  at  the  time  of  diagnosis  of  NF2. 

Comparisons  to  people  with  classical  NF2  and  nonsense  or  frameshift  mutations  (p  values  are  computed  based  on  on  assumption  of  independence,  which  is 
violated  to  a  slight  degree  awing  to  Families  with  multiple  affected  relatives); 

Somatic  mosaics:  ago  at  onset,  age  at  diagnosis,  current  age.  p<  0,001;  cataracts,  p  =*0.053, 

Splice  site  mutations:  age  at  onset,  p= 0,001;  age  at  diagnosis,  p  =  0.023;  current  age,  p  =  0.043;  intracranial  meningiomas,  p  =  0.024. 


;  f eo|le54i|t  new  mutations  and  age  at  onset  20  years:  cataracts,  p<0,001 ,  *  '  w 

*  $D,  standard  deviation.  (  J"“  *  *  ;  4  n  # 


relatively  young  study  population  (mean  age  at  diagnosis 
28  years  (SE  1);  only  5%  diagnosed  at  ages  >55  years  and 
2%  at  ages  >60  years),  since  the  prevalence  of  posterior 
subcapsular  and  cortical  cataracts  in  people  aged  <55  years 
in  the  general  population  is  very  low.10 

The  RR  of  cataracts  was  less  than  1  in  all  mutation  groups 
as  compared  to  people  with  classical  NF2  and  nonsense  or 
frameshift  mutations.  The  RR  was  significantly  lower  than  1 
in  somatic  mosaics  (RR  =  0.20,  95%  Cl  =  0.10  to  0.40),  in 
people  with  large  deletions  (RR  =  0.39,  95%  Cl  =  0.16  to 
0.98),  and  in  people  with  new  unfound  mutations  and  onset 
of  symptoms  at  ages  s=20  years  (RR  =  0.09,  95%  Cl  =  0.03  to 
0.28).  The  RR  of  cataracts  in  people  with  missense  mutations 
was  lower  than  1  but  not  statistically  significantly  so 
(RR  =  0.38,  95%  Cl  =  0.14  to  1.08). 

DISCUSSION 

The  low  RR  of  cataracts  in  somatic  mosaics,  in  people  with 
large  deletions,  and  in  people  with  new  unfound  mutations 
and  older  onset  of  symptoms  is  consistent  with  the  generally 
mild  disease  in  NF2  patients  with  each  of  these  types  of 
mutations  or  conditions.  The  low  RR  of  cataracts  in  people 


with  new  unfound  mutations  and  older  onset  of  symptoms 
could  be  due  to  several  types  of  mutations  or  conditions  that 
are  unlikely  to  be  identified  by  SSCP,  and  that  are  known  to 
be  associated  or  likely  to  be  associated  with  mild  NF2.  These 
mutations  or  conditions  are  somatic  mosaicism;  large 
deletions,  insertions,  or  other  rearrangements;  mutations  in 
the  3  or  5  untranslated  regions,  the  promoter  region,  or 
untranscribed  transcriptional  control  elements;  intronic 
mutations  that  are  not  covered  by  conventional  SSCP 
primers;  or  other  epigenetic  events  causing  loss  of  NF2 
expression,  such  as  methylation. 

Somatic  mosaicism  and  large  deletions  are  the  most  likely 
of  these  possibilities.  In  the  present  study,  17  ( 18%)  of  the  92 
patients  with  new  mutations  and  identified  constitutional 
NF2  mutations  were  somatic  mosaics.  The  estimated  pre¬ 
valence  of  somatic  mosaicism  in  NF2  patients  with  new 
mutations  is  25-30%."  12  Some  of  the  NF2  patients  with  new 
unfound  mutations  and  mild  disease  may  be  somatic  mosaics 
in  whom  conventional  DNA  sequencing  of  lymphocyte  DNA 
PCR  product  has  failed  to  identify  a  difference  from  the 
normal  sequence  because  the  mutant  allele  is  present  at  too 
low  a  level  to  be  detected.  Constitutional  NF2  large  deletions 


Table  2  Multivariate  probit  model  for  cataracts 
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Classical  NF2  45  i 

Somatic  mosaic  17 

Splice  site  39 

Missense  28 

Large  deletion  28 

People  with  new  mutations 
Inherited  cases  r  36 


Parameter  estimate 
(SE) 

0  15(0.58) 
0.11(0:15)  . 


-0  82np  j8jUP 

120  (028) 
-0.23  (0.34) 


1.00  0  98  to  1.02 

Reference  group 
071  024  to  1  !s0 

079  a  16  to  028 

079  ao°  to  028 

0.63  0.16  to  2.45 
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have  been  found  in  21%  of  NF2  families  using  microarray 
comparative  genomic  hybridisation/3  and  in  32%  of  NF2 
families  using  multiple  mutation  screening  methods/4 

The  intrafamilial  correlation  for  cataracts  was  weak  (and 
statistically  insignificant)  in  all  multivariate  probit  models 
that  were  tried,  although  there  were  relatively  few  families 
with  multiple  affected  relatives.  Several  other  clinical 
features  of  NF2  (age  at  onset  of  symptoms,  age  at  diagnosis, 
and  number  of  intracranial  meningiomas)  have  strong 
familial  correlations/5  The  prevalence  of  cataracts  in  the 
present  study  was  lower  than  in  other  studies,3-5  probably 
because  the  population-based  United  Kingdom  NF2  registry 
is  less  heavily  weighted  towards  NF2  patients  with  severe 
disease  than  studies  of  patients  from  tertiary  referral 
clinics/  5  and  because  some  cataract  examinations  were  done 
by  medical  specialists  other  than  ophthalmologists.  Non¬ 
ophthalmologists  may  miss  faint  cataracts,  but  in  such  cases 
it  is  unlikely  that  faint  cataracts  are  missed  more  frequently 
in  people  with  mild  NF2  than  in  those  with  severe  NF2  (that 
is,  it  will  not  bias  genotype-phenotype  correlations).  In  a 
previous  study  based  on  the  UK  NF2  registry,  all  patients 
were  examined  using  slitlamp  biomicroscopy  by  a  non¬ 
ophthalmologist,  and  the  prevalence  of  cataracts  was  similar 
in  mild  cases  (35%)  and  in  severe  cases  (40%). 9 

The  genotype-phenotype  correlations  for  cataracts  in  the 
present  study  extend  the  correlations  that  have  been  reported 
for  the  tumour  manifestations  of  NF2.  The  high  prevalence  of 
cataracts  in  young  NF2  patients,  and  their  frequent  occur¬ 
rence  before  the  tumour  manifestations  of  NF2,  underline  the 
importance  of  non-8th  nerve  signs  and  symptoms  of  NF2  in 
children  and  adolescents  as  a  useful  aid  to  diagnosis  in  this 
age  group.16 
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Neurofibromatosis  2  (NF2)  is  an  autosomal  dominant  disease  that  is  characterized  by  tumors  on  the  vestibular 
branch  of  the  VIII  cranial  nerve,  but  other  types  of  nervous  system  tumors  usually  occur  as  well.  Genotype-phenotype 
correlations  are  well  documented  for  overall  NF2  disease  severity  but  have  not  been  definitively  evaluated  for  specific 
types  of  non-Vin  nerve  tumors.  We  evaluated  genotype-phenotype  correlations  for  various  types  of  non-VHI  nerve 
tumors  in  406  patients  from  the  population-based  United  Kingdom  NF2  registry,  using  regression  models  with  the 
additional  covariates  of  current  age  and  type  of  treatment  center  (specialty  or  nonspecialty).  The  models  also 
permitted  consideration  of  intrafamilial  correlation.  We  found  statistically  significant  genotype-phenotype  corre¬ 
lations  for  intracranial  meningiomas,  spinal  tumors,  and  peripheral  nerve  tumors.  People  with  constitutional  NF2 
missense  mutations,  splice-site  mutations,  large  deletions,  or  somatic  mosaicism  had  significantly  fewer  tumors  than 
did  people  with  constitutional  nonsense  or  frameshift  NF2  mutations.  In  addition,  there  were  significant  intrafamilial 
correlations  for  intracranial  meningiomas  and  spinal  tumors,  after  adjustment  for  the  type  of  constitutional  NF2 
mutation.  The  type  of  constitutional  NF2  mutation  is  an  important  determinant  of  the  number  of  NF2-associated 
intracranial  meningiomas,  spinal  tumors,  and  peripheral  nerve  tumors. 


Introduction 

Neurofibromatosis  2  (NF2  [MIM  101000])  is  an  auto¬ 
somal  dominant  disease  that  is  caused  by  inactivating  mu¬ 
tations  of  the  NF2  tumor-suppressor  gene  (Rouleau  et  al. 
1993;  Trofatter  et  al.  1993).  The  clinical  abnormalities  of 
NF2  may  include  vestibular  schwannomas  (typically  bi¬ 
lateral),  intracranial  meningiomas,  spinal  tumors,  periph¬ 
eral  nerve  tumors,  and  ocular  abnormalities,  such  as  pre¬ 
sence  cataracts  (Evans  et  al.  1992a;  Parry  et  al.  1994; 
Mautner  et  al.  1 996b).  Genotype-phenotype  correlations 
for  overall  NF2  disease  severity  were  first  reported  in 
1995,  soon  after  the  identification  of  the  NF2  gene;  by 
1998,  these  correlations  were  well  established.  In  general, 
constitutional  NF2  nonsense  or  frameshift  mutations  are 
associated  with  severe  NF2,  and  missense  mutations,  in¬ 
frame  deletions,  and  somatic  mosaicism  are  associated 
with  mild  NF2  (Merel  et  al.  1995;  Kluwe  et  al.  1996, 
1998;  Parry  et  al.  1996;  Ruttledge  et  al.  1996;  Evans  et 
al.  1998^,  1998b;  Kluwe  and  Mautner  1998). 

Initial  genotype-phenotype  correlation  studies  of 
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NF2  were  limited  by  the  generality  of  the  definition  of 
disease  severity,  which  was  often  reported  only  as 
“mild,”  “moderate,”  or  “severe.”  The  mild  and  severe 
disease  categories  correspond  to  the  historical  nomen¬ 
clature  of  “Gardner”  (mild)  and  “Wishart”  (severe) 
subtypes,  which  were  based  on  the  clinical  observation 
that  the  severity  of  NF2  tends  to  “run  true”  within  a 
family  (Wishart  1822;  Gardner  and  Frazier  1930).  An¬ 
other  category,  “Lee-Abbott,”  which  corresponds  to 
very  severe  NF2,  was  not  consistently  adopted  by  sub¬ 
sequent  studies  (Lee  and  Abbott  1969). 

More  recent  genotype-phenotype  correlation  studies 
tend  to  use  indices,  such  as  age  at  onset  of  symptoms  of 
NF2,  that  portray  the  continuum  of  overall  disease  se¬ 
verity.  Age  at  onset  of  symptoms  reflects  the  burden  of 
the  many  different  types  of  clinical  abnormalities  in  NF2. 
In  patients  with  NF2  who  present  symptomatically,  the 
initial  symptom  is  related  to  vestibular  schwannomas  in 
1/3  to  2/3  of  patients  (Evans  et  al.  1992a;  Parry  et  al. 
1994;  Mautner  et  al.  19966).  Patients  with  NF2  whose 
initial  symptom  occurs  in  adulthood  tend  to  present  with 
Vm  nerve  symptoms,  whereas  the  initial  symptom  of 
children  with  NF2  is  often  related  to  clinical  abnor¬ 
malities  other  than  vestibular  schwannomas  (Mautner 
et  al.  1993;  MacCollin  and  Mautner  1998;  Evans  et  al. 
1999;  Nunes  and  MacCollin  2003). 

Parry  et  al.  (1994)  used  age  at  onset  of  NF2  symptoms 
as  a  criterion  to  define  overall  disease  severity  (aged  <20 
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years,  severe  disease;  aged  ^20  years,  mild  disease).  The 
importance  of  age  at  onset  of  NF2  symptoms  as  a  pre¬ 
dictor  of  subsequent  disease  course  has  been  validated 
in  longitudinal  studies.  Age  at  onset  of  symptoms  or 
age  at  diagnosis  are  the  most  important  predictors  of 
vestibular  schwannoma  growth  rates  in  NF2  (Baser  et 
al.  2002b;  Mautner  et  al.  2002)  and  of  the  risk  of  death 
in  patients  with  NF2  (Baser  et  al.  2002a;  Otsuka  et  al. 
2003). 

In  the  past  decade,  NF2  mutation  analysis  techniques 
have  been  refined,  and  the  definition  of  the  clinical  spec¬ 
trum  of  NF2  has  been  broadened.  However,  genotype- 
phenotype  correlation  studies  have  been  based  on  rela¬ 
tively  few  patients  because  of  the  rarity  of  the  disease 
(Evans  et  al.  1992b),  and  genotype-phenotype  correla¬ 
tions  have  been  established  only  for  overall  NF2  disease 
severity,  not  for  the  various  types  of  non-VUI  nerve  tu¬ 
mors  that  often  occur  in  NF2.  In  the  present  study,  we 
found  genotype-phenotype  correlations  for  the  number 
of  NF2-associated  intracranial  meningiomas,  spinal  tu¬ 
mors,  and  peripheral  nerve  tumors.  We  also  found  sig¬ 
nificant  correlations  within  families  for  intracranial  men¬ 
ingiomas  and  spinal  tumors. 

Methods 

Patients 

Data  for  this  study  were  obtained  from  the  popula¬ 
tion-based  United  Kingdom  NF2  registry  (Department 
of  Medical  Genetics,  St.  Mary’s  Hospital,  Manchester). 
Patients  with  NF2  are  ascertained  by  contacting  neuro¬ 
surgeons,  otolaryngologists,  neurologists,  pediatricians, 
dermatologists,  and  geneticists  throughout  the  United 
Kingdom,  augmented  in  the  North  West  Region  by  the 
Regional  Cancer  Registry.  The  study  was  subject  to  con¬ 
tinuing  ethics  committee  evaluation,  and  patients  con¬ 
sented  to  participation. 

People  were  eligible  for  inclusion  if  they  met  the  Man¬ 
chester  clinical  diagnostic  criteria  for  NF2  (Evans  et  al. 
1992a)  ( n  =  394)  or  did  not  meet  these  criteria  but  had 
an  identified  constitutional  NF2  mutation  (n  =  12).  We 
included  only  those  patients  who  had  been  screened  for 
constitutional  NF2  mutations  and  had  available  clinical 
information  from  imaging  studies  of  intracranial  men¬ 
ingiomas,  had  spinal  tumors  or  tumors  of  cranial  nerves 
other  than  the  VUI  nerve,  or  had  available  clinical  infor¬ 
mation  from  physical  examination  on  peripheral  nerve 
tumors.  The  analysis  excluded  several  groups  with  too 
few  people  to  analyze  separately:  somatic  mosaics  who 
had  identified  NF2  mutations  other  than  nonsense  or 
frameshift  mutations  (n  =  8)  and  people  with  consti¬ 
tutional  NF2  in-frame  deletions  (n  —  3)  or  chromoso¬ 
mal  translocations  involving  the  NF2  locus  (n  =  3).  The 
406  people  included  in  the  study  were  from  267  families 


(247  people  with  new  mutations  and  159  inherited  cases; 
200  females  and  206  males).  There  were  data  for  389 
people  with  intracranial  meningiomas,  351  people  with 
spinal  tumors,  332  people  with  peripheral  nerve  tumors, 
and  319  people  with  tumors  of  cranial  nerves  other  than 
the  VIII  nerve  (278  people  had  all  four  types  of  tumors). 
The  analyses  for  each  type  of  tumor  were  based  on  the 
people  who  had  data  for  that  specific  type  of  tumor. 

Detection  of  NF2  Mutations 

Samples  were  tested  for  the  presence  of  point  mu¬ 
tations  by  direct  sequencing  of  Meta-PCR  products 
(Wallace  et  al.  1999).  The  entire  cDNA  of  the  NF2 
gene  was  covered  in  four  Meta-PCR  reactions.  Gel- 
purified  Meta-PCR  products  were  then  sequenced  in 
both  orientations  by  use  of  BigDye  v2  chemistry,  ac¬ 
cording  to  the  manufacturer’s  instructions.  Sequencing 
electropherograms  were  scanned  for  mutations  by  use 
of  the  trace  subtraction  algorithm  “TraceDiff,”  which 
is  available  as  a  component  of  the  Staden  package.  In 
samples  in  which  mutations  were  identified,  the  pres¬ 
ence  of  mutations  was  confirmed  by  use  of  the  genomic 
DNA  sample  for  repetition  of  the  Meta-PCR  amplifi¬ 
cation  and  both  sequencing  reactions.  Large  deletions, 
nonsense  mutations,  frameshift  mutations,  and  muta¬ 
tions  affecting  invariate  splice-site  sequences  were  as¬ 
sumed  to  be  pathogenic.  Missense  changes  were  de¬ 
fined  as  pathogenic  if  they  had  been  published  and 
shown  to  affect  NF2  expression  or  function.  For  other 
missense  changes,  parental  samples  were  analyzed;  if 
the  mutation  arose  de  novo  in  the  patient,  this  was 
taken  as  strong  evidence  of  pathogenicity. 

Dosage  analysis  was  performed  using  a  quantitative 
PCR-based  assay  in  10-mL  volumes  with  100  ng  ge¬ 
nomic  DNA.  Exons  1,  4,  8,  and  15  from  the  NF2  gene 
and  three  amplicons  derived  from  the  cystic  fibrosis  gene 
were  amplified  within  a  multiplex  PCR.  The  NF2  prim¬ 
ers  and  control  primers  all  had  identical  "universal”  for¬ 
ward  and  reverse  sequences  at  their  5'  termini  and  were 
present  at  low  concentration  in  the  PCR  reaction  (50 
nM).  Universal  forward  and  reverse  primers  were  also 
included  but  at  a  higher  concentration  (0.5  mM)  than  the 
genomic-specific  primers.  The  universal  forward  primer 
was  5/-labeled  with  the  fluorescent  dye  FAM,  thereby 
allowing  fluorescent  quantification  of  the  PCR  products 
from  each  amplicon.  The  PCR  reactions  were  arrested 
at  23  cycles  to  maintain  the  amplification  reaction  within 
the  logarithmic  phase.  Electrophoresis  and  quantitative 
measurement  of  yield  of  each  product  was  performed  on 
an  ABB  100  Genetic  Analyzer  with  Genescan  and  Geno- 
typer  software.  Relative  peak  heights  of  all  amplicons 
of  each  test  sample  were  compared  with  a  normalized 
average  of  five  negative  controls  by  use  of  the  dosage 
quotient  (DQ)  method  (Yau  et  al.  1996).  Calculations 
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Table  1 

Characteristics  of  406  People  with  NF2,  by  Type  of  NF2  Mutation 

Type  of  NF2  Mutation 


Unfound  Mutations 


People  with  New 
Mutations,  with  Age 
at  Onset  of 


Nonsense  or  Frameshift 

Splice 

Site 

Large 

Deletion 

Inherited 

Symptoms 

Characteristic 

Classical 

Somatic  Mosaic 

Missense 

Cases 

<20  years 

2*20  years 

No.  of  people/families 

111/75 

23 

78/39 

26/8 

48/20 

25/7 

29 

66 

Age  in  years  (mean  ±  SD): 

At  onset  of  symptoms8 

19  ±12 

30±  11 

23  ±12 

32  ±17 

24  ±13 

22±  13 

13  ±  8 

36±  12 

At  diagnosis 

23  ±13 

37±11 

28  ±15 

41±21 

28  ±14 

26±  13 

22  ±10 

43  ±13 

Vestibular  schwannoma  (%): 

None 

7 

0 

13 

0 

2 

13 

3 

3 

Unilateral 

6 

26 

3 

19 

4 

0 

3 

20 

Bilateral 

87 

74 

84 

81 

94 

87 

94 

77 

Intracranial  meningioma  {%) 

53 

65 

38 

22 

42 

30 

62 

48 

Peripheral  nerve  tumors  (%) 

66 

39 

52 

25 

54 

79 

75 

39 

Spinal  tumors  {%) 

69 

57 

47 

57 

33 

62 

73 

45 

Tumors  of  non-VIII  nerve  cranial  nerves  (%) 

45  . 

27 

33 

24 

39 

28 

54 

28 

a  Excludes  20  inherited  cases  who  were  asymptomatic  at  the  time  of  diagnosis  of  NF2. 


were  performed  on  Microsoft  Excel  spreadsheets  written 
specifically  for  this  purpose.  A  sample  from  a  patient 
with  an  NF2  deletion  was  included  as  a  control  within 
each  run  of  samples.  If  only  a  single  NF2  measurement 
showed  a  DQ  measurement  within  the  deleted  range 
(0.35-0.65),  the  test  was  repeated.  If  results  of  the  re¬ 
peated  test  agreed  with  the  first  result,  the  deleted  exon 
was  amplified  and  sequenced  with  primers  £hat  lay  ex¬ 
ternal  to  the  dosage  primer  binding  sites,  to  exclude  the 
possibility  that  a  polymorphism  of  the  primer  binding 
site  had  affected  amplification  efficiency. 

Magnetic  Resonance  Imaging  (MRI)  of  the  Brain 
and  Spine 

Each  patient  had  an  MRI  examination  of  the  brain 
and  full  spinal  cord  at  the  time  of  NF2  diagnosis.  Sub¬ 
sequent  MRI  examinations  were  performed  as  indicated 
clinically  or  for  purposes  of  routine  follow-up,  as  de¬ 
termined  by  the  patient’s  physician.  The  brain  precon¬ 
trast  MRI  protocol  included  sagittal  T1 -weighted  spin 
echo  imaging  (5-mm  slices)  and  axial  T2-weighted  turbo 
spin  echo  imaging  (5-mm  slices).  The  brain  postcontrast 
protocol  included  axial  Tl-weighted  spin  echo  imaging 
(3-mm  slices  through  the  posterior  fossa;  5-mm  slices 
through  the  rest  of  the  brain)  and  coronal  Tl-weighted 
spin  echo  imaging  (5-mm  slices  through  the  whole  brain). 
If  the  patient  had  a  previous  translabyrinthine  removal 
of  a  vestibular  schwannoma,  then  the  postcontrast  axial 
3-mm  slices  were  done  with  fat  suppression.  The  spinal 
canal  postcontrast  MRI  protocol  included  sagittal  and 
coronal  Tl-weighted  spin  echo  imaging  (4-mm  slices). 
Axial  Tl-weighted  spin  echo  images  and  sagittal  T2- 


weighted  turbo  spin  echo  images  also  were  included,  if 
indicated.  The  number  of  spinal  tumors,  as  counted  by 
the  radiologist  (J.E.G.),  was  confirmed  by  another  phy¬ 
sician  (D.G.R.E.),  who  reread  the  spinal  MRIs. 

Statistical  Analysis 

The  cross-sectional  associations  of  the  type  of  consti¬ 
tutional  NF2  mutation  with  the  various  kinds  of  nervous 
system  tumors  were  assessed  by  regression  models.  Nu¬ 
merical  maximum  likelihood  was  estimated  using  the 
quasi-Newton  method.  Computations  were  performed 
using  C  programs  developed  at  the  Department  of  Sta¬ 
tistics,  University  of  British  Columbia. 

The  models  were  chosen  on  the  basis  of  the  nature  of 
the  dependent  variable.  A  gamma  mixture  of  negative 
binomials  model  was  used  to  model  the  association  of 
the  type  of  constitutional  NF2  mutation  with  the  number 
of  intracranial  meningiomas,  spinal  tumors,  and  pe¬ 
ripheral  nerve  tumors,  because  the  count  distribution 
of  these  types  of  tumors  was  heavily  right  skewed.  A 
probit  model  was  used  to  evaluate  the  association  of 
the  type  of  constitutional  NF2  mutation  with  the  pres¬ 
ence  or  absence  of  tumors  of  cranial  nerves  other  than 
the  VIII  nerve.  In  addition,  all  of  the  models  had  an 
exchangeable  correlation  structure  within  families  that 
permitted  assessment  of  intrafamilial  effects  beyond  those 
produced  by  the  type  of  constitutional  NF2  mutation. 
The  other  covariates  were  the  potential  confounders  of 
current  age  (when  tumor  burden  was  most  recently  as¬ 
sessed  by  MRI)  and  type  of  treatment  center  (specialty 
or  nonspecialty).  The  genotype-phenotype  correlation 
tables  present,  for  each  type  of  NF2  mutation,  the  av- 
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Table  2 

Genotype-Phenotype  Correlations  for  Intracranial  Meningiomas 


Am.  ).  Hum.  Genet.  75:000-000 ,  2004 


Type  of  NF2  Mutation4 

Sample  Average  of  No. 
of  Intracranial 
Meningiomas 

Model-Based  Estimated 
Average  of  No. 
of  Intracranial 
Meningiomas 

95%  Cl 

for  Count  Ratios1* 

Nonsense  or  frameshift: 
Classical  NF2 

1.2 

1.4 

1.00c 

Somatic  mosaic 

2.8 

2.5 

,74-3.85 

Splice-site 

1.0 

.9 

.34-1.12 

Missense 

.3 

.3 

.06-.  64 

Large  deletion 

.9 

1.0 

.30-1.43 

Unfound  mutations: 

Inherited  cases 

.7 

.8 

.22-1.30 

People  with  new  mutations: 
Age  at  onset  <20  years 

1.9 

2.2 

.77-2.92 

Age  at  onset  s*20  years 

1.6 

1.2 

.41-1.63 

Note. — Statistically  significant  results  are  shown  in  bold  italics. 

8  Other  covariates:  Intrafamilial  correlation  coefficient  =  .24  (95%  Cl  .10-.39).  Current  age  (per  year): 
1.01  (95%  Cl  1.00-1.02).  Type  of  treatment  center  (specialty  vs.  nonspecialty):  .92  (95%  Cl  .61-1.40). 
b  Compared  with  people  with  classical  NF2  and  nonsense  or  frameshift  mutations. 
c  Reference  group. 


erage  number  or  prevalence  of  tumors  in  the  sample  and 
the  model-based  number  or  prevalence  of  tumors  that 
were  estimated  using  maximum  likelihood,  as  described 
above. 

The  type  of  mutation  variable  was  categorical  and 
coded  as  eight  binary  variables.  The  variables  were  in¬ 
dicators  of  somatic  mosaicism  (defined  at  the  molecular 
level  and  including  only  mosaics  with  nonsense  or 
frameshift  mutations),  splice-site  mutations,  missense 
mutations,  large  deletions,  and  unfound  mutations,  each 
compared  with  nonsense  or  frameshift  mutations.  Since 
a  large  proportion  of  patients  with  NF2  (especially  those 
with  new  mutations)  have  unfound  mutations,  people 
with  unfound  mutations  were  divided  into  three  sub¬ 
groups:  inherited  cases,  people  with  new  mutations  and 
onset  of  symptoms  of  NF2  at  age  <20  years  (i.e.,  severe 
disease),  and  people  with  new  mutations  and  onset  of 
symptoms  of  NF2  at  age  ^20  years  (i.e.,  mild  disease). 

People  who  met  the  Manchester  clinical  diagnostic 
criteria  for  NF2  (Evans  et  al.  1992 a)  and  who  had  full 
constitutional  NF2  nonsense  or  frameshift  mutations 
were  the  reference  group  in  comparisons  between  dif¬ 
ferent  types  of  NF2  mutations.  Genotype-phenotype  cor¬ 
relations  were  evaluated  using  tumor  count  ratios  or 
relative  risks  (RR)  and  their  95%  CIs  from  the  models. 
Count  ratios  or  RR  with  95%  CIs  that  excluded  zero 
were  considered  to  be  statistically  significant. 

Results 

The  characteristics  of  the  study  population  are  presented 
in  table  1.  People  with  somatic  mosaicism  or  constitu¬ 
tional  NF2  splice-site  mutations,  missense  mutations,  or 


large  deletions  were  significantly  older  at  onset  of  NF2 
symptoms  and  at  NF2  diagnosis  than  were  people  with 
classical  NF2  and  constitutional  NF2  nonsense  or 
frameshift  mutations.  The  results  of  the  genotype-phe¬ 
notype-correlation  analyses  for  the  various  types  of  ner¬ 
vous  system  tumors  are  presented  in  tables  2-5.  Relative 
to  people  with  classical  NF2  and  constitutional  nonsense 
or  frameshift  NF2  mutations,  there  tended  to  be  fewer 
tumors  in  somatic  mosaics  and  in  people  with  constitu¬ 
tional  NF2  splice-site  mutations,  missense  mutations, 
or  large  deletions  (see  appendix  [online  only]  for  list 
of  unique  NF2  mutations). 

People  with  classical  NF2  and  nonsense  or  frameshift 
mutations  had  a  model-based  estimated  average  of  1.4 
meningiomas  (table  2).  There  were  significantly  fewer  (an 
average  of  0.3)  meningiomas  in  people  with  missense 
mutations. 

There  were  extensive  genotype-phenotype  correlations 
for  peripheral  nerve  tumors  (table  3).  People  with  clas¬ 
sical  NF2  and  nonsense  or  frameshift  mutations  had  a 
model-based  estimated  average  of  4.0  peripheral  nerve 
tumors.  There  were  significantly  fewer  peripheral  nerve 
tumors  in  somatic  mosaics  (an  average  of  1.4)  and  in 
people  with  splice-site  mutations  (an  average  of  1.3), 
missense  mutations  (an  average  of  1.6),  or  large  dele¬ 
tions  (an  average  of  1.6). 

People  with  classical  NF2  and  nonsense  or  frameshift 
mutations  had  a  model-based  estimated  average  of  4.1 
spinal  tumors  (table  4).  There  were  significantly  fewer 
spinal  tumors  in  people  with  splice-site  mutations  (an 
average  of  2.1)  or  large  deletions  (an  average  of  0.8). 
There  were  fewer  spinal  tumors  in  somatic  mosaics,  but 
not  significantly  so. 
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Table  3 

Genotype-Phenotype  Correlations  for  Peripheral  Nerve  Tumors 


Type  of  NF2  Mutation3 

Sample  Average  of  No. 
of  Peripheral  Nerve 
Tumors 

Model-Based  Estimated 
Average  of  No. 
of  Peripheral  Nerve 
Tumors 

95%  Cl 

for  Count  Ratiosb 

Nonsense  or  frameshift: 

Classical  NF2 

3.6 

4.0 

1.00c 

Somatic  mosaic 

1.4 

1A 

.16-.73 

Splice-site 

13 

13 

.19-.58 

Missense 

1.5 

1.6 

.18-.93 

Large  deletion 

1.6 

1.6 

.21-.79 

Unfound  mutations: 

Inherited  cases 

2.6 

2.8 

.33-1.44 

People  with  new  mutations: 

Age  at  onset  <20  years 

4.X 

4.3 

.58-2.02 

Age  at  onset  s=20  years 

1.4 

1A 

.20-.  60 

Note. — Statistically  significant  results  are  shown  in  bold  italics. 

a  Other  covariates:  Intrafamilial  correlation  coefficient  =  .13  (95%  Cl  -.03  to  .29).  Current  age  (per 
year):  .99  (95%  Cl  .98-1.01).  Type  of  treatment  center  (specialty  vs.  nonspecialty):  1.19  (95%  Cl  .75-1.87). 
b  Compared  with  people  with  classical  NF2  and  nonsense  or  frameshift  mutations. 
c  Reference  group. 


Tumors  occurred  on  all  cranial  nerves  other  than  the 
Vm  nerve  but  were  most  common  on  the  V  nerve  (13% 
of  subjects),  the  XU  nerve  (7%),  and  the  II,  VII,  and  X 
nerves  (3% — 4%  each).  Tumors  of  cranial  nerves  other 
than  the  VIII  nerve  were  analyzed  in  the  aggregate,  be¬ 
cause  the  prevalence  of  tumors  on  individual  cranial 
nerves  was  low.  In  people  with  classical  NF2  and  non¬ 
sense  or  frameshift  mutations,  the  model-based  esti¬ 
mated  prevalence  of  tumors  of  cranial  nerves  other  than 
the  Vm  nerve  was  45%  (table  5).  There  were  not  sig¬ 
nificant  genotype-phenotype  correlations. 

In  addition  to  genotype-phenotype  correlations  for 
specific  types  of  mutations,  people  with  new  unfound 
mutations  and  older  age  at  onset  of  symptoms  tended 
to  have  milder  disease  than  people  with  classical  NF2 
and  nonsense  or  frameshift  mutations.  Compared  with 
the  reference  group,  people  with  new  mutations  and  older 
age  at  onset  of  symptoms  had  a  significantly  lower 
model-based  estimated  number  of  peripheral  nerve  tu¬ 
mors  and  spinal  tumors  (tables  3  and  4).  The  model- 
based  estimated  prevalence  of  tumors  of  cranial  nerves 
other  than  the  VIII  nerve  was  lower,  but  not  significantly 
so. 

Other  covariates  contributed  significantly  to  the  mod¬ 
els  for  meningiomas  and  spinal  tumors,  after  adjustment 
for  the  type  of  constitutional  NF2  mutation.  There  were 
significant  intrafamilial  correlations  for  meningiomas  (an 
average  of  0.24)  and  spinal  tumors  (an  average  of 
0.21).  The  number  of  meningiomas  increased  signifi¬ 
cantly  with  increasing  age  (1.01  per  year  of  age),  and 
there  were  significantly  more  spinal  tumors  on  average 
(2.4)  in  people  who  were  seen  in  specialty  centers  than 
in  those  who  were  seen  in  nonspecialty  centers. 


Discussion 

There  are  genotype-phenotype  correlations  for  the  num¬ 
ber  of  NF2-associated  intracranial  meningiomas,  spinal 
tumors,  and  peripheral  nerve  tumors.  These  results  ex¬ 
tend  the  genotype-phenotype  correlations  that  have  been 
reported  for  overall  NF2  disease  severity  (Merel  et  al. 
1995;  Kluwe  et  al.  1996,  1998;  Parry  et  al.  1996;  Rutt- 
ledge  et  al.  1996;  Evans  et  al.  1998*,  19986;  Kluwe  and 
Mautner  1998).  The  present  study  and  other  genotype- 
phenotype  correlation  studies  that  used  data  from  the 
United  Kingdom  NF2  registry  (Evans  et  al.  1998#;  Baser 
et  al.  2003)  are  based  on  patient  information  from  a 
variety  of  types  of  clinics  throughout  the  United  King¬ 
dom,  whereas  other  such  studies  often  have  been  based 
on  patients  from  neurofibromatosis  clinics  (Merel  et  al. 
1995;  Kluwe  et  al.  1996,  1998;  Parry  et  al.  1996;  Rut- 
tledge  et  al.  1996).  Studies  that  are  based  on  patients 
from  specialty  clinics  may  be  biased  toward  people  with 
more  severe  disease,  and  there  may  be  other  biases  from 
specialized  referral  patterns. 

Intracranial  meningiomas  occur  in  about  half  of  pa¬ 
tients  with  NF2  (Evans  et  al.  1992a;  Parry  et  al.  1994; 
Mautner  et  al.  19966).  The  present  study  demonstrates 
that  the  penetrance  of  meningiomas  increases  with  in¬ 
creasing  age,  after  adjustment  for  the  type  of  NF2  mu¬ 
tation  in  a  regression  model.  The  genotype-phenotype 
correlation  of  missense  mutations  with  meningiomas  is 
of  interest  because  missense  mutations  produce  NF2 
protein  (termed  “merlin”  or  “schwannomin”)  that  is 
defective  in  negative  growth  regulation,  whereas  non¬ 
sense  mutations  do  not  produce  stable  merlin  (Gutmann 
et  al.  1998).  Missense  mutations  also  produce  merlin 
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Table  4 

Genotype-Phenotype  Correlations  for  Spinal  Tumors 


Am.  J.  Hum.  Genet.  75:000-000 ,  2004 


Type  of  NF2  Mutation® 

Sample  Average  of  No. 
of  Spinal  Tumors 

Model-Based  Estimated 
Average  of  No. 
of  Spinal  Tumors 

95%  Cl  for  Count 
Ratios1* 

Nonsense  or  frameshift: 

Classical  NF2 

4.1 

4.1 

1.00c 

Somatic  mosaic 

2.1 

1.7 

.21-1.16 

Splice  site 

2.1 

2.1 

.26-34 

Missense 

2.0 

2.2 

.23-1.99 

Large  deletion 

. 8 

.8 

.10-.52 

Unfound  mutations: 

Inherited  cases 

1.9 

1.6 

.15-1.06 

People  with  new  mutations: 

Age  at  onset  <20  years 

6.1 

4.5 

.72-3.32 

Age  at  onset  >20  years 

1.1 

1.1 

.13-50 

Note, — Statistically  significant  results  are  shown  in  bold  italics. 

*  Other  covariates:  Intrafamilial  correlation  coefficient  =  .21  (95%  Cl  .10-.32).  Current  age  (per  year):  1.00 
(95%  Cl  .97-1.03).  Type  of  treatment  center  (specialty  vs.  nonspecialty):  2.42  (95%  Cl  1.62-8.70). 
b  Compared  with  people  with  classical  NF2  and  nonsense  or  frameshift  mutations. 
c  Reference  group. 


with  reduced  but  not  absent  binding  with  /311-spectrin 
(Scoles  et  al.  1998),  which  may  be  caused  by  complex 
conformational  changes  that  alter  merlin  folding  and 
affect  access  to  a  binding  site  (Scoles  et  al.  2002). 

Spinal  tumors  are  found  in  up  to  90%  of  patients  with 
NF2  when  the  full  spine  is  scanned  by  MRI  (Mautner 
et  al.  1995).  In  the  present  study,  patients  with  NF2 
who  had  been  seen  at  specialty  treatment  centers  had  a 
significantly  higher  number  of  spinal  tumors,  after  ad¬ 
justment  for  the  type  of  NF2  mutation  and  other  co¬ 
variates  in  a  regression  model.  Patients  with  NF2  who 
are  seen  at  specialty  treatment  centers  may  be  more 
likely  to  have  full  spine  MRI,  which  would  lead  to  the 
association  of  specialty  treatment  centers  with  a  higher 
number  of  spinal  tumors.  Intradural  and  extradural  ex¬ 
tramedullary  spinal  tumors  in  patients  with  NF2  are 
usually  schwannomas  or  meningiomas,  whereas  intra¬ 
medullary  tumors  are  usually  ependymomas  or  astro¬ 
cytomas.  A  previous  study  noted  that  intramedullary 
spinal  tumors  were  less  common  in  NF2-affected  patients 
with  several  types  of  NF2  mutations  in  the  aggregate 
(splice-site  mutations,  missense  mutations,  or  in-frame 
deletions)  than  in  those  with  nonsense  or  frameshift 
mutations  (Patronas  et  al.  2001).  However,  specific  ge¬ 
notype-phenotype  correlations  were  not  found  in  that 
study. 

Peripheral  nerve  tumors  occur  in  about  half  of  people 
with  NF2  (Mautner  et  al.  1997).  The  prevalence  of  pe¬ 
ripheral  nerve  tumors  is  significantly  less  in  people  with 
mild  NF2  than  in  those  with  severe  NF2  (Mautner  et 
al.  1997).  This  is  suggestive  of  genotype-phenotype  cor¬ 
relations,  but,  prior  to  this  study,  such  correlations  had 
not  been  evaluated. 


Spinal  tumors  and  peripheral  nerve  tumors  were  less 
common  in  people  with  somatic  mosaicism  than  in  peo¬ 
ple  with  classical  NF2  and  constitutional  NF2  nonsense 
or  frameshift  mutations.  Each  of  these  types  of  tumors 
was  also  less  common  in  people  with  new  unfound  mu¬ 
tations  and  older  age  at  onset  of  symptoms  than  in 
people  in  the  reference  group.  The  latter  association  is 
not  a  specific  genotype-phenotype  correlation,  because 
patients  with  unfound  NF2  mutations  may  have  differ¬ 
ent  types  of  mutations.  However,  it  is  likely  that  most 
of  these  patients  have  somatic  mosaicism.  A  similar  pat¬ 
tern  occurs  in  NF2-associated  cataracts,  which  are  less 
common  in  people  with  somatic  mosaicism  or  new  un¬ 
found  mutations  and  older  age  at  onset  of  symptoms 
than  in  people  with  classical  NF2  and  constitutional  NF2 
nonsense  or  frameshift  mutations  (Baser  et  al.  2003). 

Somatic  mosaicism  occurs  in  an  estimated  25%-30% 
of  patients  with  NF2  who  have  new  mutations  (Evans 
et  al.  1998 b;  Kluwe  and  Mautner  1998;  Kluwe  et  al. 
2003;  Moyhuddin  et  al.  2003).  The  relatively  low  ef¬ 
ficiency  (~60%)  of  conventional  mutation  screening 
techniques  in  detecting  constitutional  NF2  mutations  in 
patients  with  NF2  who  have  new  mutations  has  been 
attributed  to  somatic  mosaicism.  In  somatic  mosaics, 
conventional  DNA  sequencing  of  lymphocyte  DNA 
PCR  product  often  fails  to  identify  a  difference  from 
the  normal  sequence  because  the  mutant  allele  is  present 
at  too  low  a  level  to  be  detected.  In  this  study,  somatic 
mosaics  with  nonsense  or  frameshift  mutations  had  sig¬ 
nificantly  fewer  peripheral  nerve  tumors  than  did  people 
with  full  constitutional  NF2  nonsense  or  frameshift 
mutations. 

In  this  study,  spinal  tumors  and  peripheral  nerve  tu- 
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Table  5 


Genotype-Phenotype  Correlations  for  Non-VlII  Nerve  Cranial  Nerve  Tumors 


Type  of  NF2  Mutation" 

Sample  Average  of 
Prevalence  of  Non-VIII 
Nerve  Cranial  Nerve 
Tumors  (%) 

Model-Based  Estimated 
Prevalence  of  Non-VIII 
Nerve  Cranial  Nerve 
Tumors  (%) 

RRb 

95%  Cl 

Nonsense  or  frameshift: 
Classical  NF2 

44 

45 

1.00c 

Somatic  mosaic 

27 

27 

.37 

.10-1.37 

Splice-site 

33 

35 

.60 

.23-1.57 

Missense 

24 

24 

.31 

.06-1.62 

Large  deletion 

41 

42 

.85 

.28-2.58 

Unfound  mutations: 

Inherited  cases 

28 

24 

.31 

.07-1.44 

People  with  new  mutations 
Age  at  onset  <20  years 

54 

54 

1.54 

.51^4.62 

Age  at  onset  >20  years 

28 

28 

.39 

.15-1.01 

a  Other  covariates:  Intrafamilial  correlation  coefficient  =  .36  (95%  Cl  -.13  to  .64).  Current  age  (per 
year):  1.00  (95%  Cl  .85-1.17).  Type  of  treatment  center  (specialty  vs.  nonspecialty):  1.00  (95%  Cl  .28- 
3.52). 


b  Compared  with  people  with  classical  NF2  and  nonsense  or  frameshift  mutations. 
c  Reference  group. 


mors  were  less  common  in  people  with  constitutional 
NF2  large  deletions  than  in  people  with  constitutional 
NF2  nonsense  or  frameshift  mutations.  There  have  been 
other  reports  of  mild  NF2  in  people  with  constitutional 
NF2  large  deletions  (Lopez-Correa  et  al.  2000;  Bruder 
et  al.  2001).  Constitutional  NF2  large  deletions  are  found 
in  21%-32%  of  families  with  NF2,  when  multiple 
screening  methods  or  microarray-comparative  genomic 
hybridization  are  used  (Zucman-Rossi  et  al.  1998;  Bru¬ 
der  et  al.  2001).  Conventional  mutation  screening  tech¬ 
niques  such  as  SSCP  analysis  are  unable  to  detect  large 
deletions  in  autosomal  genes  because  those  methods  are 
not  designed  to  measure  gene  dosage.  When  a  large 
deletion  is  present,  the  normal  allele  alone  is  amplified 
and  analyzed;  consequently,  the  deletion  is  cryptic  to 
the  analytical  method.  A  gene  dosage  method  was  used 
to  detect  large  deletions  in  this  study. 

Historically,  NF2  disease  severity  was  said  to  “run 
true”  in  families,  but  some  studies  noted  marked  intra¬ 
familial  clinical  variability  (Kluwe  and  Mautner  1996; 
Mautner  et  al.  1 996a;  Scoles  et  al.  1996).  In  a  previous 
study  based  on  an  international  NF2  clinicomolecular 
database,  we  found  that  there  were  strong  intrafamilial 
correlations  for  age  at  onset  of  hearing  loss  (all  mutation 
types,  0.51;  families  with  missense  mutations,  0.67), 
and  there  were  significant  but  weaker  intrafamilial  cor¬ 
relations  for  intracranial  meningiomas  (all  mutation 
types,  0.29;  families  with  splice-site  mutations,  0.39) 
(Zhao  et  al.  2002).  The  present  study  confirms  this  con¬ 
clusion  with  respect  to  intracranial  meningiomas  in  a 
partially  overlapping  data  set  and  additionally  demon¬ 
strates  that  there  is  a  significant  (although  weak)  intra¬ 


familial  correlation  for  spinal  tumors.  These  intrafam¬ 
ilial  correlations  were  observed  in  regression  models 
after  adjustment  for  the  type  of  mutation,  and  it  is  pos¬ 
sible  that  the  correlations  reflect  differences  in  the  phe¬ 
notypic  effects  produced  by  individual  NF2  alleles  that 
comprise  a  mutation  type.  It  is  also  possible  that  the 
intrafamilial  correlations  are  evidence  of  other  genetic 
loci  that  affect  the  NF2  phenotype.  Other  evidence  sug¬ 
gests  that  modifying  genes  may  affect  the  clinical  mani¬ 
festations  of  NF2  (Bruder  et  al.  1999). 

The  type  of  constitutional  NF2  mutation  is  an  im¬ 
portant  determinant  of  the  number  of  NF2-associated 
intracranial  meningiomas,  spinal  tumors,  and  peripheral 
nerve  tumors.  Constitutional  NF2  splice-site  mutations 
may  have  different  effects,  depending  on  the  location 
within  the  gene;  that  is,  3'  mutations  may  cause  less 
severe  NF2  than  do  5'  mutations  (Kluwe  et  al.  1998). 
Recent  longitudinal  studies  have  found  that  the  age  at 
onset  of  symptoms  of  NF2  and  age  at  diagnosis  of  NF2 
are  the  strongest  predictors  of  vestibular  schwannoma 
growth  rates  (Baser  et  al.  20026;  Mautner  et  al.  2002) 
and  the  risk  of  mortality  (Baser  et  al.  2002^;  Otsuka  et 
al.  2003).  People  with  constitutional  NF2  missense  mu¬ 
tations  have  a  lower  risk  of  mortality  than  do  those 
with  constitutional  NF2  nonsense  or  frameshift  muta¬ 
tions  (Baser  et  al.  2002a).  Additional  longitudinal  ge¬ 
notype-phenotype  correlation  studies  of  associations  be¬ 
tween  the  type  of  constitutional  NF2  mutation  and  the 
development  of  specific  types  of  NF2-associated  tumors 
may  provide  information  that  is  useful  for  the  clinical 
management  of  people  with  NF2. 


000 


Am.  J.  Hum.  Genet.  75:000-000, 2004 


Acknowledgments 

This  work  was  supported,  in  part,  by  the  FBT  Foundation. 

We  thank  the  patients  with  NF2  and  their  families  for  partici¬ 
pating  in  this  study. 

Electronic-Database  Information 

The  URLs  for  data  presented  herein  are  as  follows: 

Online  Mendelian  Inheritance  in  Man  (OMIM),  http://www 
.ncbi.nlm.nih.gov/Omim/  (for  NF2) 

Staden,  http://staden.sourceforge.net/  (for  the  trace  subtraction 
algorithm  “TraceDiff”) 

References 

Baser  ME,  Friedman  JM,  Aeschliman  D,  Joe  H,  Wallace  AJ, 
Ramsden  RT,  Evans  DGR  (2002#)  Predictors  of  the  risk  of 
mortality  in  neurofibromatosis  2.  Am  J  Hum  Genet  71:715- 
723 

Baser  ME,  Kuramoto  L,  Joe  H,  Friedman  JM,  Wallace  AJ, 
Ramsden  RT,  Evans  DGR  (2003)  Genotype-phenotype  cor¬ 
relations  for  cataracts  in  neurofibromatosis  2.  J  Med  Genet 
40:758-760 

Baser  ME,  Makariou  EV,  Parry  DM  (2002 b)  Predictors  of 
vestibular  schwannoma  growth  in  patients  with  neurofibro¬ 
matosis  type  2.  J  Neurosurg  96:217-222 

Bruder  CEG,  Hirvela  C,  Tapia-Paez  I,  Fransson  I,  Segraves  R, 
Hamilton  G,  Zhang  XX,  et  al  (2001)  High  resolution  dele¬ 
tion  analysis  of  constitutional  DNA  from  neurofibromatosis 
type  2  (NF2)  patients  using  microarray-CGH.  Hum  Mol  Ge¬ 
net  10:271-282 

Bruder  CEG,  Ichimura  K,  Blennow  E,  Ikeuchi  T,  Yamaguchi 
T,  Yuasa  Y,  Collins  VP,  Dumanski  JP  (1999)  Severe  pheno¬ 
type  of  neurofibromatosis  type  2  in  a  patient  with  a  7.4-MB 
constitutional  deletion  on  chromosome  22:  possible  location 
of  a  neurofibromatosis  type  2  modifier  gene?  Genes  Chromo¬ 
somes  Cancer  25:184-190 

Evans  DGR,  Birch  JM,  Ramsden  RT  (1999)  Paediatric  presen¬ 
tation  of  type  2  neurofibromatosis.  Arch  Dis  Child  81:496- 
499 

Evans  DGR,  Huson  SM,  Donnai  D,  Neary  W,  Blair  V,  Newton 
V,  Harris  R  (1991a)  A  clinical  study  of  type  2  neurofibroma¬ 
tosis.  Q  J  Med  84:603-618 

Evans  DGR,  Huson  SM,  Donnai  D,  Neary  W,  Blair  V,  Teare 
D,  Newton  V,  Strachan  T,  Ramsden  R,  Harris  R  (1991b)  A 
genetic  study  of  type  2  neurofibromatosis  in  the  United  King¬ 
dom.  I.  Prevalence,  mutation  rate,  fitness,  and  confirmation 
of  maternal  transmission  effect  on  severity.  J  Med  Genet  29: 
841-846 

Evans  DGR,  Trueman  L,  Wallace  A,  Collins  S,  Strachan  T 
(19 98a)  Genotype/phenotype  correlations  in  type  2  neuro¬ 
fibromatosis  (NF2):  evidence  for  more  severe  disease  asso¬ 
ciated  with  truncating  mutations.  J  Med  Genet  35:450-455 

Evans  DGR,  Wallace  AJ,  Wu  CL,  Trueman  L,  Ramsden  RT, 
Strachan  T  (1998 b)  Somatic  mosaicism:  a  common  cause  of 
classic  disease  in  tumor-prone  syndromes?  Lessons  from  type 
2  neurofibromatosis.  Am  J  Hum  Genet  63:727-736 


Gardner  WJ,  Frazier  CH  (1930)  Bilateral  acoustic  neuromas: 
a  clinical  study  and  field  survey  of  a  family  of  five  genera¬ 
tions  with  bilateral  deafness  in  thirty-eight  members.  Arch 
Neurol  Psychiatr  23:266-302 

Gutmann  DH,  Geist  RT,  Xu  H-m,  Kim  JS,  Sapor ito-Irwin  S 
(1998)  Defects  in  neurofibromatosis  2  protein  function  can 
arise  at  multiple  levels.  Hum  Mol  Genet  7:335-345 
Kluwe  L,  Beyer  S,  Baser  ME,  Hazim  W,  Haase  W,  Fiinsterer 
C,  Mautner  V-F  (1 996)  Identification  of  NF2  germ-line  mu¬ 
tations  and  comparison  with  NF2  phenotypes.  Hum  Genet 
98:534-538 

Kluwe  L,  MacCollin  M,  Tatagiba  M,  Thomas  S,  Hazim  W, 
Haase  W,  Mautner  V-F  (1998)  Phenotypic  variability  asso¬ 
ciated  with  14  splice-site  mutations  in  the  NF2  gene.  Am  J 
Med  Genet  77:228-233 

Kluwe  L,  Mautner  V-F  (1996)  A  missense  mutation  in  the  NF2 
gene  results  in  moderate  and  mild  clinical  phenotypes  of  neu¬ 
rofibromatosis  type  2.  Hum  Genet  97:224-227 
-  (1998)  Mosaicism  in  sporadic  neurofibromatosis  2  pa¬ 
tients.  Hum  Mol  Genet  7:2051-2055 
Kluwe  L,  Mautner  V-F,  Heinrich  B,  Dezube  R,  Jacoby  LB, 
Friedrich  RE,  MacCollin  M  (2003)  Molecular  study  of  fre¬ 
quency  of  mosaicism  in  neurofibromatosis  2  patients  with 
bilateral  vestibular  schwannomas.  J  Med  Genet  40:109-114 
Lee  DK,  Abbott  ML  (1969)  Familial  central  nervous  system 
neoplasia:  case  report  of  a  family  with  von  Recklinghausen’s 
neurofibromatosis.  Arch  Neurol  20:154-160 
Lopez-Correa  C,  Zucman-Rossi  J,  Brems  H,  Thomas  G,  Legius 
E  (2000)  NF2  gene  deletion  in  a  family  with  a  mild  pheno¬ 
type.  J  Med  Genet  37:75-77 

MacCollin  M,  Mautner  V-F  (1998)  The  diagnosis  and  manage¬ 
ment  of  neurofibromatosis  2  in  childhood.  Semin  Pediatr 
Neurol  5:243-252 

Mautner  V-F,  Baser  ME,  Kluwe  L  (1 996a)  Phenotypic  varia¬ 
bility  in  two  families  with  novel  splice-site  and  frameshift  NF2 
mutations.  Hum  Genet  98:203-206 
Mautner  V-F,  Baser  ME,  Thakkar  SD,  Feigen  UM,  Friedman 
JM,  Kluwe  L  (2002)  Vestibular  schwannoma  growth  in  pa¬ 
tients  with  neurofibromatosis  type  2:  a  longitudinal  study.  J 
Neurosurg  96:223-228 

Mautner  V-F,  Lindenau  M,  Baser  ME,  Hazim  W,  Tatagiba  M, 
Haase  W,  Samii  M,  Wais  R,  Pulst  S-M  (1 996b)  The  neuro¬ 
imaging  and  clinical  spectrum  of  neurofibromatosis  2.  Neu¬ 
rosurgery  38:880-886 

Mautner  V-F,  Lindenau  M,  Baser  ME,  Kluwe  L,  Gottschalk  J 
(1997)  Skin  abnormalities  in  neurofibromatosis  2.  Arch  Der¬ 
matol  133:1539-1543 

Mautner  V-F,  Tatagiba  M,  Guthoff  R,  Samii  M,  Pulst  S-M  (1993 ) 
Neurofibromatosis  in  the  pediatric  age  group.  Neurosurgery 
33:92-96 

Mautner  V-F,  Tatagiba  M,  Lindenau  M,  Funsterer  C,  Pulst 
S-M,  Baser  ME,  Kluwe  L,  Zanella  FE  (1995)  Spinal  tumors 
in  patients  with  neurofibromatosis  type  2:  MR  imaging  study 
of  frequency,  multiplicity,  and  variety.  AJR  Am  J  Roentgenol 
165:951-955  (erratum  166:1231) 

Merel  P,  Hoang-Xuan  K,  Sanson  M,  Bijlsma  E,  Rouleau  G, 
Laurent-Puig  P,  Pulst  S,  Baser  M,  Lenoir  G,  Sterkers  JM, 
Philippon  J,  Resche  F,  Mautner  V-F,  Fischer  G,  Hulsebos  T, 
Aurias  A,  Delattre  O,  Thomas  G  (1995)  Screening  for  germ- 


Baser  et  ah:  Genotype-Phenotype  Correlations  in  NF2 


000 


line  mutations  in  the  NF2  gene.  Genes  Chromosomes  Cancer 
12:117-127 

Moyhuddin  A,  Baser  ME,  Watson  C,  Purcell  S,  Ramsden  RT, 
Heiberg  A,  Wallace  AJ,  Evans  DGR  (2003)  Somatic  mo¬ 
saicism  in  neurofibromatosis  2:  prevalence  and  risk  of  dis¬ 
ease  transmission  to  offspring.  J  Med  Genet  40:459-463 
Nunes  F,  MacCollin  M  (2003)  Neurofibromatosis  2  in  the  pe¬ 
diatric  population.  J  Child  Neurol  18:718-724 
Otsuka  G,  Saito  K,  Nagatani  T,  Yoshida  J  (2003)  Age  at  symp¬ 
tom  onset  and  long-term  survival  in  patients  with  neurofi¬ 
bromatosis  type  2.  J  Neurosurg  99:480-483 
Parry  DM,  Eldridge  R,  Kaiser-Kupfer  MI,  Bouzas  E,  Pikus  A, 
Patronas  N  (1994)  Neurofibromatosis  2  (NF2):  clinical  char¬ 
acteristics  of  63  affected  individuals  and  clinical  evidence  for 
heterogeneity.  Am  J  Med  Genet  52:450-461 
Parry  DM,  MacCollin  MM,  Kaiser-Kupfer  MI,  Pulaski  K,  Nich¬ 
olson  HS,  Bolesta  M,  Eldridge  R,  Gusella  JF  (1996)  Germ¬ 
line  mutations  in  the  neurofibromatosis  2  gene:  correlations 
with  disease  severity  and  retinal  abnormalities.  Am  J  Hum 
Genet  59:529-539 

Patronas  NJ,  Courcoutsakis  N,  Bromley  CM,  Katzman  GL, 
MacCollin  M,  Parry  DM  (2001)  Intramedullary  and  spinal 
canal  tumors  in  patients  with  neurofibromatosis  2:  MR  im¬ 
aging  findings  and  correlation  with  genotype.  Radiology 
218:434-442 

Rouleau  GA,  Merel  P,  Lutchman  M,  Sanson  M,  Zucman  J, 
Marineau  C,  Hoang-Xuan  K,  Demczuk  S,  Desmaze  C,  Plou- 
gastel  B,  Pulst  SM,  Lenoir  G,  Bijlsma  E,  Fashold  R,  Duman- 
ski  J,  de  Jong  P,  Parry  D,  Eldridge  R,  Aurias  A,  Delattre  O, 
Thomas  G  (1993)  Alteration  in  a  new  gene  encoding  a  puta¬ 
tive  membrane-organizing  protein  causes  neurofibromatosis 
type  2.  Nature  363:515-521 

Ruttledge  MH,  Andermann  AA,  Phelan  CM,  Claudio  JO,  Han 
F-y,  Chretien  N,  Rangaratnam  S,  MacCollin  M,  Short  P, 
Parry  D,  Michels  V,  Riccardi  VM,  Weksberg  R,  Kitamura  K, 
Bradburn  JM,  Hall  BD,  Propping  P,  Rouleau  GA  (1996) 
Type  of  mutation  in  the  neurofibromatosis  type  2  gene  ( NF2 ) 


frequently  determines  severity  of  disease.  Am  J  Hum  Genet 
59:331-342 

Scoles  D,  Chen  M,  Pulst  S-M  (2002)  Effects  of  NF2  missense 
mutations  on  schwannomin  interactions.  Biochem  Biophys 
Res  Commun  290:366-374 

Scoles  DR,  Baser  ME,  Pulst  S-M  (1996)  A  missense  mutation 
in  the  neurofibromatosis  2  gene  occurs  in  patients  with  mild 
and  severe  phenotypes.  Neurology  47:544-546 

Scoles  DR,  Huynh  DP,  Morcos  PA,  Coulsell  ER,  Robinson 
NGG,  Tamanoi  F,  Pulst  S-M  (1998)  Neurofibromatosis  2 
tumour  suppressor  schwannomin  interacts  with  j8II-spectrin. 
Nat  Genet  18:354-359 

Trofatter  JA,  MacCollin  MM,  Rutter  JL,  Murrell  JR,  Duyao 
MP,  Parry  DM,  Eldridge  R,  Kley  N,  Menon  AG,  Pulaski  K, 
Haase  VH,  Ambrose  CM,  Munroe  D,  Bove  C,  Haines  JL, 
Martuza  RL,  MacDonald  ME,  Seizinger  BR,  Short  MP,  Buck¬ 
ler  AJ,  Gusella  JF  (1993)  A  novel  moesin-,  ezrin-,  radixin¬ 
like  gene  is  a  candidate  for  the  neurofibromatosis  2  tumor 
suppressor.  Cell  72:791-800  (erratum  75:826) 

Wallace  AJ,  Wu  C-L,  Elies  RG  (1999)  Meta-PCR:  a  novel 
method  for  creating  chimaeric  DNA  molecules  and  increas¬ 
ing  the  productivity  of  mutation  screening  techniques.  Ge¬ 
netic  Testing  3:173-183 

Wishart  JH  (1822)  Case  of  tumours  in  the  skull,  dura  mater, 
and  brain.  Edinburgh  Med  Surg  J  18:393-397 

Yau  SC,  Bobrow  M,  Mathew  CG,  Abbs  SJ  (1996)  Accurate 
diagnosis  of  carriers  of  deletions  and  duplications  in  Duch- 
enne/Becker  muscular  dystrophy  by  fluorescent  dosage  analy¬ 
sis.  J  Med  Genet  33:550-558 

Zhao  Y,  Kumar  RA,  Baser  ME,  Evans  DG,  Wallace  A,  Kluwe 
L,  Mautner  V-F,  Parry  DM,  Rouleau  GA,  Joe  H,  Friedman 
JM  (2002)  Intrafamilial  correlation  of  clinical  manifestations 
in  neurofibromatosis  2  (NF2).  Genet  Epidemiol  23:245-259 

Zucman-Rossi  J,  Legoix  P,  Sarkissian  HD,  Cheret  G,  Sor  F, 
Bernard!  A,  Cazes  L,  Giraud  S,  Ollagnon  E,  Lenoir  G,  Tho¬ 
mas  G  (1998)  NF2  gene  in  neurofibromatosis  type  2  pa¬ 
tients.  Hum  Mol  Genet  7:2095-2101 


1  of  5 


ELECTRONIC  LETTER 


Evaluation  of  genotype-phenotype  correlations  in 
neurofibromatosis  type  1 

B  Castle,  M  E  Baser,  S  M  Huson,  D  N  Cooper,  M  Upadhyaya 

J  Med  Genet  2003;40;el09  (http://www.jmedgenet.com/cgi/content/fuli/40/10/el09) 


Neurofibromatosis  type  1  (NF1)  is  an  autosomal 
dominant  disease  with  complete  penetrance  and 
extremely  variable  expression,  and  an  incidence  of 
approximately  1  in  4000  live  births.  Despite  the  high 
incidence  of  NF1,  neither  the  natural  history  nor  the  genetic 
epidemiology  of  the  disorder  are  well  understood.  People 
with  NF1  have  reduced  reproductive  fitness  and  life 
expectancy,1  but  the  cause  of  the  high  mutation  rate  at  the 
NF1  locus  is  unknown. 

The  most  prominent  clinical  hallmarks  of  NF1  are  cafe-au- 
lait  (CAL)  macules,  neurofibromas,  Iisch  nodules  of  the  iris, 
and  axillary  freckling.2  Other  clinical  manifestations  are 
abnormalities  of  the  cardiovascular,  gastrointestinal,  renal, 
and  endocrine  systems,  facial  and  body  disfigurement, 
cognitive  deficit,  and  malignancies  of  the  peripheral  nerve 
sheath  and  central  nervous  system.  About  25%  of  people  with 
NF1  develop  one  or  more  of  these  clinical  complications, 
which  together  cause  significant  morbidity  and  mortality.3 
The  tumours  that  occur  in  NF1  are  dermal  and  plexiform 
neurofibromas,  optic  gliomas,  malignant  peripheral  nerve 
sheath  tumours  (MPNSTs),  pheochromocytomas,  and  rhab¬ 
domyosarcomas.1  Children  with  NF1  have  an  increased  risk 
of  developing  myeloid  disease,  particularly  juvenile  chronic 
myeloid  leukaemia.  Some  30-40%  of  NF1  patients  develop 
plexiform  neurofibromas2"1  that  become  MPNSTs  in  5-10%  of 
cases,5-7  often  in  pre-existing  plexiform  neurofibromas. 

The  NF1  gene  on  chromosome  17qll,2  spans  more  than 
350  kb  of  genomic  DNA  and  contains  60  exons.*  The  8457  bp 
open  reading  frame  predicts  a  protein  of  2818  amino  acids 
with  an  estimated  mass  of  327  kDa.9 10  The  NF1  gene  product, 
neurofibromin,  contains  a  360  amino  acid  region  with 
homology  to  the  catalytic  domain  of  the  mammalian 
guanosine  triphosphatase  activating  protein.  Neurofibromin 
is  a  GTPase  activating  protein  for  members  of  the  p21ras 
(Ras)  protein  family.  Loss  of  neurofibromin  function  leads  to 
downstream  cell  growth  activation  because  neurofibromin 
negatively  regulates  Ras  output  by  accelerating  the  conver¬ 
sion  of  active  Ras-GTP  to  inactive  Ras -GDP.  Three  genes 
(EVI2A,EVI2B  and  OMGP)  are  embedded  within  intron  27  but 
are  transcribed  in  the  opposite  direction  to  the  NF1  gene.11  12 
However,  in  a  comparative  study  of  various  NF1  gross  gene 
deletions,  Viskochil8  concluded  that  haploinsufficiency  of  the 
three  embedded  genes  did  not  contribute  to  the  clinical 
features  of  NF1. 

Genetic  counselling  is  problematic  in  NF1  owing  to  the 
marked  inter-  and  intra-familial  variation  in  NF1  expression. 
The  exploration  of  genotype-phenotype  correlations  in  NF1  is 
still  in  its  infancy  because  of  the  extensive  mutational 
heterogeneity  of  the  gene,  and  because  large  scale  mutation 
screening  is  laborious  owing  to  the  size  and  complexity  of  the 
gene.  In  this  study,  we  characterised  constitutional  NF1  gene 
mutations  in  113  NF1  patients,  and  statistically  evaluated 
genotype-phenotype  correlations  in  110  of  these  patients.  We 
also  considered  atypical  cases  in  detail  because  allelic 
heterogeneity  is  a  possible  cause  of  some  NF1  phenotypes, 


Key  points 


|  Iqffons  in  1  '  ^  ;  ' 

•  The  clinical  abnormalities  were  cafe-au-lait  macules, 
axillaiy  freckling,  number  of  dermal  and  nodular 


particularly  Watson  syndrome  and  NF  Noonan  syndrome 
(NFNS). 13-15 


METHODS 

The  constitutional  NF1  gene  mutations  were  identified  in  one 
laboratory  using  a  battery  of  different  mutation  detection 
techniques:  single  strand  conformational  polymorphism 
analysis,  heteroduplex  analysis,  the  protein  truncation  test, 
dideoxy  fingerprinting,  denaturing  high  performance  liquid 
chromatography,  and  direct  DNA  sequencing.1^20  Clinical 
data  were  obtained  from  medical  records  of  examinations 
within  the  previous  2  years,  or  if  recent  medical  data  were 


Abbreviations:  CAL,  cafe-au-laif;  MPNSTs,  malignant  peripheral  nerve 
sheath  tumours;  NF,  neurofibromatosis;  NFNS,  neurofibromatosis 
Noonan  syndrome;  NIH,  (US)  National  Institutes  of  Health 
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unavailable,  the  patients  were  re-examined.  The  examining 
physician  was  blinded  to  the  patient's  mutation. 

There  were  113  NF1  patients  with  identified  NF1  gene 
mutations.  Three  of  these  patients  were  excluded  from  the 
statistical  analysis  of  genotype-phenotype  correlations:  two 
patients  who  did  not  meet  the  US  National  Institutes  of 
Health  (NIH)  diagnostic  criteria  for  NF121  and  a  single  patient 
with  an  identified  NF1  large  gene  deletion.  The  common 
clinical  abnormalities  were  coded  as:  lisch  nodules  (present/ 
absent),  number  of  CAL  macules  (0,  1-6,  >6),  axillary 
freckling  (present/absent),  number  of  dermal  and  nodular 
neurofibromas  (0,  1-10,  11-100,  101-500,  >500),  macro- 
cephaly  (present/absent),  short  stature  (present/absent),  the 
ratio  of  head  circumference  (cm)  to  height  (m)  (OFC/height 
ratio),  pectus  excavatum  (present/absent),  learning  difficul¬ 
ties  (present/absent),  plexiform  neurofibromas  (present/ 
absent),  scoliosis  (present/absent),  and  spinal  neurofibromas 
(present/absent).  Percentiles  for  the  OFC/height  ratio  were 
calculated  using  charts.22  25  The  prevalence  of  other  clinical 
abnormalities  (glioma,  sarcoma,  phaeochromocytoma,  gas¬ 
trointestinal  neurofibromas,  pulmonary  stenosis,  renal  artery 
stenosis,  delayed  puberty,  aqueduct  stenosis,  Watson  syn¬ 
drome,  Noonan  syndrome,  epilepsy,  bone  dysplasia)  was  too 
low  (5%  or  less)  for  meaningful  statistical  analysis. 

Logistic  regression  analysis  was  used  to  calculate  relative 
risks  (RR)  and  95%  confidence  intervals  (Cl)  for  binary 
outcomes.  Linear  regression  analysis  was  used  for  continuous 
outcomes.  Each  regression  model  had  the  covariates  of  the 
type  of  constitutional  NF1  mutation  and  age  at  examination 
(as  a  continuous  variable).  The  mutation  covariate  was  coded 
as  several  binary  variables  (indicators  of  missense  and  splice 
site  mutations,  each  compared  to  nonsense  or  frameshift 
mutations).  Age  at  examination  was  included  as  a  covariate 
because  the  penetrance  of  many  NF1  disease  features 
increases  with  age.24  25  A  logarithmic  transformation  was 
used  for  age  at  examination,  which  ranged  from  2  to  70  years. 
There  is  a  potential  bias  toward  lower  age  at  examination  in 
non-probands  relative  to  probands,  but  proband  status  did 
not  contribute  significantly  to  any  of  the  models  after 
including  age  at  examination  as  a  covariate,  so  probands 
and  non-probands  were  combined  in  subsequent  analyses. 
There  were  eight  families  with  two  affected  relatives  and  two 
families  with  three  affected  relatives. 

We  evaluated  genotype-phenotype  correlations  for  each 
common  clinical  abnormality  individually  and  for  three 
groups  of  disease  features:26  (a)  CAL  macules,  axillary 
freckling,  and  Lisch  nodules,  (b)  dermal,  nodular,  and 
plexiform  neurofibromas,  and  (c)  macrocephaly,  optic 
glioma,  and  other  neoplasms.  A  patient  was  categorised  as 
belonging  to  a  group  if  at  least  two  disease  features  in  the 
group  were  present. 

RESULTS 

The  identified  constitutional  NF1  gene  lesions  ranged  from 
single  base  pair  substitutions  to  gross  deletions,  but  the 
microlesions  appeared  to  be  uniformly  distributed  across 
the  gene  (http://www.uwcm.ac.uk/study/medicine/medical_ 
genetics/research/nfl/supplementary_data.html).  The  preva¬ 
lence  of  common  clinical  abnormalities  in  this  study  was 
generally  similar  to  those  of  other  published  studies  (table  1). 
Delayed  and  precocious  puberty  occur  in  NF1,27  28  but  the 
exact  prevalence  is  unknown.  In  this  study,  6%  of  males  and 
5%  of  females  had  delayed  puberty  at  16  years  of  age.  One 
patient  had  an  empty  sella  turcica  without  abnormal 
pituitary  function  and  a  second  patient  with  very  short 
stature  had  isolated  growth  hormone  deficiency  with  other¬ 
wise  normal  pituitary  function.  There  were  no  patients  with 
precocious  puberty. 


Genotype-phenotype  correlations 

Eighty  four  patients  had  ocular  examinations.  As  expected, 
the  relative  risk  (RR)  of  lisch  nodules  increased  with 
increasing  age  at  examination  (table  2).  When  compared 
with  people  with  nonsense  or  frameshift  mutations,  people 
with  missense  mutations  had  an  RR  of  lisch  nodules  that 
was  lower  than  unity  but  of  borderline  statistical  significance 
(RR  =  0.26,  95%  Cl  0.07  to  1.04,  p  =  0.06,  Cox  and  Snell 
R2  =  0.10).  There  were  no  significant  genotype-phenotype 
correlations  for  any  other  individual  abnormalities  or  for  any 
of  the  three  groups  of  disease  features  (data  not  shown). 

Atypical  cases 

There  were  two  patients  who  did  not  fulfil  the  NIH  diagnostic 
criteria  for  NF1  and  who  were  not  included  in  the  statistical 
analysis  of  genotype-phenotype  correlations.  One  was  a 
10  year  old  child  who,  in  common  with  his  mother,  had  CAL 
macules  but  no  other  stigmata  suggestive  of  NF1.  The  NF1 
mutation  in  the  family  was  a  missense  substitution 
(R1809L).  The  second  patient  was  a  27  year  old  woman  with 
no  family  history  of  NF1  and  no  CAL  spots,  freckling,  or  Lisch 
nodules.  She  had  a  small  number  (<  1 0)  of  cutaneous  and 
nodular  neurofibromas,  confirmed  on  biopsy.  In  addition,  she 
had  a  vestibular  schwannoma  and  a  schwannoma  of  the 
posterior  fossa  involving  the  vagus  nerve.  A  cervical  cord 
ependymoma  was  resected,  leaving  the  patient  paraplegic. 
Although  initially  considered  as  a  possible  case  of  NF  type  2, 
a  single  base  pair  insertion  in  exon  37  (6792  ins  A)  of  the  NF1 
gene  was  subsequently  identified  in  this  patient. 

There  were  three  patients  with  NFNS.  The  first  NFNS 
patient  was  a  6  year  old  boy  from  a  large  Ashkenazi  kindred 
with  NF1.  He  had  only  four  CAL  macules  on  examination  but 
had  been  developing  cutaneous  neurofibromas  since  the  age 
of  2  years,  and  had  intracranial  tumours  and  hydrocephalus 
secondary  to  aqueduct  stenosis,  which  had  required  shunt¬ 
ing.  There  were  no  Lisch  nodules  and  he  was  not 
macrocephalic.  He  had  facial  dysmorphism  suggestive  of 
Noonan  syndrome  but  had  an  inactivating  splice  site 
mutation  in  the  NF1  gene  (IVS  29+1  G-»C). 

The  second  NFNS  patient  was  a  20  year  old  man  who 
fulfilled  the  NIH  diagnostic  criteria  for  NF1,  and  had  a  pectus 
excavatum,  learning  difficulties,  and  mild  hydrocephalus.  He 
was  also  thought  to  have  aqueduct  stenosis  causing  hydro¬ 
cephalus,  which  did  not  require  shunting  at  the  time  of 
examination.  The  NF1  mutation  was  a  microdeletion  (2133 
del  CC).  There  was  no  evident  cardiac  abnormality  in  these 
two  patients. 

The  third  NFNS  patient  was  a  10  year  old  boy  who  was 
bom  prematurely  and  had  facial  dysmorphism,  widely  spaced 
nipples,  and  an  atrial-septal  defect.  His  NF1  mutation  was  a 
microinsertion  (4093  ins  TG). 

There  were  three  patients  with  Watson  syndrome.  Two  had 
mild  to  moderate  pulmonary  stenosis  that  was  documented 
on  echocardiography.  Neither  had  a  history  of  significant 
learning  difficulties,  but  they  did  not  complete  their  school¬ 
ing.  One  patient  had  a  novel  3  bp  deletion  (AAT)  within  exon 
17  of  the  NF1  gene  leading  to  a  loss  of  ATG  (methionine)  in 
exon  17,29  and  the  other  had  a  paternally  inherited  nonsense 
mutation  in  exon  29  (S1745X).  This  patient's  father  also  had 
mild  pulmonary  stenosis  with  stigmata  of  NF1.  The  third 
patient  was  a  49  year  old  man  with  significant  learning 
difficulties,  behavioural  problems  associated  with  epilepsy, 
and  pulmonary  stenosis.  He  had  an  intragenic  NF1  gene 
deletion  of  >20  bp. 

Another  atypical  patient  was  a  31  year  old  woman  who 
had  stigmata  of  NF1  almost  exclusively  in  the  cranial  region. 
There  were  four  CAL  macules  on  the  head  and  neck  only,  and 
multiple  large  histologically  confirmed  neurofibromas  on  the 
scalp  and  behind  the  ears.  There  was  also  macrocephaly  but 
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no  other  manifestations  of  NF1  in  the  rest  of  the  body.  In 
addition,  the  proband  was  diagnosed  with  Van  der  Woude 
syndrome  at  the  age  of  6  months,  with  a  cleft  palate  and  lip 
pits,  which  were  surgically  treated.  The  NFI  mutation  was  a 
microdeletion  in  exon  22  (3731  del  T). 

DISCUSSION 

NFI  patients  with  missense  mutations  had  an  RR  of  Lisch 
nodules  that  was  less  than  1,  and  of  borderline  statistical 
significance  when  compared  to  NF1  patients  with  nonsense 
or  frameshift  mutations  and  adjusted  for  age  at  examination. 
This  exploratory  finding  merits  investigation  in  other  NF1 
patient  populations,  although  it  is  difficult  to  evaluate  the 
pathogenicity  of  missense  mutations  without  functional 
studies.  In  this  study,  multiple  statistical  comparisons  were 
made  for  the  different  clinical  abnormalities  and  different 
mutation  types,  but  as  these  analyses  were  exploratory,  not 
confirmatory,  we  chose  to  test  associations  at  the  a  =  0.05 
level. 

From  the  clinical  viewpoint,  our  findings  are  notable  for 
two  reasons.  The  OFC/height  ratio  was  at  or  above  the  95th 
percentile  in  97%  of  cases,  but  the  prevalence  of  macro- 
cephaly  was  40%  and  the  prevalence  of  short  stature  was 
50%.  this  suggests  that  an  increase  in  the  OFC/height  ratio  is 
a  useful  clinical  indicator  of  NF1,  even  in  the  absence  of 
absolute  macrocephaly  or  short  stature.  The  finding  of 
delayed  puberty  in  6%  of  males  and  5%  of  females  at  16  years 
of  age  confirms  that  this  is  an  NF1  related  phenomenon. 

We  did  not  find  genotype-phenotype  correlations  for  other 
common  clinical  features  of  NF1,  similar  to  the  findings  of 
most  other  studies  of  genotype-phenotype  correlations  in 
NF1.4  30“33 

The  only  exception  is  the  correlation  of  large  NF1  gene 
deletions  (up  to  1.5  Mb  genomic  DNA)  with  an  earlier  age  of 
onset  of  cutaneous  neurofibromas,  learning  disability,  dys¬ 
morphic  features,  and  developmental  delay.18  34'38  However, 
not  all  patients  with  a  gross  NF1  gene  deletion  have  this  NF1 
phenotype,18  36  and  flanking  DNA  sequence  (or  the  lack  of  it) 
may  affect  the  phenotype.  Easton  et  ah™  modelled  the  genetic 
contributions  to  eight  traits  in  175  people  with  NF1  (CAL 


macules,  neurofibromas,  head  circumference,  plexiform 
neurofibromas,  optic  gliomas,  scoliosis,  epilepsy,  and  reme¬ 
dial  education).  There  were  no  significant  correlations 
between  the  traits,  indicating  that  the  phenotypic  expression 
of  NF1  was  largely  determined  by  trait  specific  loci  unlinked 
to  the  NF1  gene. 

We  were  not  able  to  evaluate  genotype-phenotype  correla¬ 
tions  for  some  of  the  most  clinically  significant  disease 
features  of  NF1,  such  as  the  various  types  of  cancer.  The  RR 
of  these  complications  is  very  high  compared  with  the 
general  population,  but  the  prevalence  in  NF1  is  relatively 
low,  and  there  were  too  few  patients  with  these  complica¬ 
tions  in  this  study  to  support  statistical  analysis. 

Identical  NF1  gene  mutations  can  occur  in  unrelated 
patients  with  very  different  phenotypes.11  19  38-41  Owing  to 
the  absence  of  strong  genotype-phenotype  correlations,30  NF1 
mutation  analysis  has  very  limited  predictive  utility  for 
specific  sequelae.  NF1  mutation  analysis  has  greater  diag¬ 
nostic  utility  because  mutation  detection  can  confirm  the 
aetiology  of  the  disease  in  the  relatively  few  families  and 
individuals  in  whom  the  clinical  phenotype  does  not  fulfil  the 
NIH  diagnostic  criteria.  A  provisional  diagnosis  of  NFI 
improves  clinical  management  by  triggering  the  initiation 
of  routine  blood  pressure  monitoring  in  patients  from  an 
early  age  and  alerting  medical  care  providers  to  the  possibility 
of  clinical  complications.  A  firm  diagnosis  allows  counselling 
regarding  mode  of  inheritance,  recurrence  risk,  and  poten¬ 
tially,  prenatal  diagnosis. 

Allelic  heterogeneity  is  a  possible  cause  of  some  of  the 
multiple  phenotypes  in  NFI,  particularly  Watson  syndrome 
(multiple  CAL  macules,  pulmonic  stenosis,  and  dull  intelli¬ 
gence),13  NFNS,15  and  familial  spinal  neurofibromatosis.42  To 
date,  however  there  is  no  evidence  to  support  allelic 
heterogeneity  as  a  cause  for  any  of  these  three  atypical 
phenotypes,  nor  did  we  find  such  evidence  in  the  present 
study,  as  detailed  below. 

Allelic  heterogeneity  is  not  likely  to  be  responsible  for 
Watson  syndrome  because  the  three  patients  in  the  present 
study  with  Watson  syndrome  had  NFI  mutations  of  differing 
types  and  locations.  Divergent  NFI  gene  mutations  have  been 


Table  2  Results  of  logistic  regression 
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identified:  an  80  kb  deletion  in  a  Watson  syndrome  patient/6 
and  a  42  bp  duplication  that  segregated  in  a  family  with  a 
phenotype  suggestive  of  NFNS.43 

About  12-13%  of  NF1  patients  who  are  specifically 
examined  for  Noonan  syndrome  have  a  clinical  phenotype 
that  is  similar  to  Noonan  syndrome/4  15  but  Carey,44  and 
Carey  and  Viskochil32  reviewed  these  patients,  and  concluded 
that  they  did  not  have  typical  Noonan  syndrome,  as  features 
of  Noonan  syndrome  without  cardiovascular  malformation 
have  been  observed  in  many  NF1  patients.  The  three  patients 
in  the  present  study  with  features  of  Noonan  syndrome  had 
NF1  mutations  of  differing  types  and  locations.  As  two  of 
these  patients  had  hydrocephalus  associated  with  features  of 
aqueduct  stenosis,  the  dysmorphic  features  of  Noonan 
syndrome  could  be  a  secondary  effect,  and  NFNS  may  not 
be  a  distinct  clinical  entity.  There  is  clinical  overlap  between 
Noonan  syndrome  and  NF1,  but  the  Noonan  syndrome  gene 
{PTPN11)  has  been  mapped  to  12q24.1  and  there  is  no 
suggestion  of  locus  heterogeneity  for  NF1. 

Spinal  neurofibromas  have  been  reported  in  up  to  38%  of 
NF1  patients,  but  only  5%  of  these  patients  have  spinal 
neurofibromatosis 42  All  patients  in  our  study  who  had 
familial  spinal  neurofibromatosis  also  fulfilled  the  NIH 
diagnostic  criteria  for  NF1,  and  had  Iisch  nodules  and 
dermal  neurofibromas,  as  has  been  previously  reported  42  45 
Ars  et  al42  described  a  frameshift  mutation  in  exon  46  (8042 
ins  A)  that  cosegregated  with  spinal  neurofibromatosis.  In 
another  study,  two  unrelated  NF1  patients  with  spinal 
neurofibromatosis  had  an  NF1  missense  mutation  in  exon 
33  (L2067P)  and  an  NF1  splice  site  mutation  (IVS31-5 
A-^G).46  A  recurrent  NF1  splice  site  mutation  (IVS19b-3 
C— >G)  was  identified  in  a  patient  with  spinal  neurofibroma¬ 
tosis.47  In  the  present  study,  patients  with  spinal  neurofi¬ 
bromatosis  also  had  different  types  of  mutations.  These 
findings  indicate  that  specific  NF1  gene  mutations  are  not 
associated  with  spinal  neurofibromatosis. 

One  NF1  patient  in  this  study  also  had  Van  der  Woude 
syndrome,  a  dominantly  inherited  disorder  with  locus 
heterogeneity.  The  majority  of  Van  der  Woude  patients  have 
linkage  to  the  lq32-41  locus,48  but  there  is  a  second  locus  on 
lp34.49  A  modifier  locus  exists  on  17pll.2/°  but  there  is  no 
evidence  to  suggest  an  association  with  the  NF1  locus  at 
17qll.2.  This  patient  may  be  segregating  two  distinct  and 
unrelated  autosomal  dominant  conditions. 

The  marked  clinical  variability  between  multiple  affected 
relatives  in  NF1  families  also  could  be  due  to  the  nature, 
timing,  or  location  of  the  "second  hit"  mutations  at  the 
NF1  locus  in  dermal  and  plexiform  neurofibromas/1'55 
MPSNTs,56-58  pheochromocytomas/9  pilocytic  astrocytomas,60 
and  juvenile  chronic  myelogenous  leukaemia  cells.61  Variable 
expressivity  has  also  been  observed  in  other  tumour  prone 
genetic  conditions  such  as  neurofibromatosis  2,  tuberous 
sclerosis  1  and  2,  and  familial  adenomatous  polyposis.62  63 

Somatic  mosaicism,  which  has  been  documented  in  over 
60  monogenic  disorders,64  is  another  potential  cause  of  inter¬ 
individual  phenotypic  variation.  Somatic  mosaicism  can 
occur  sufficiently  early  in  embryonic  life  to  involve  both 
somatic  and  germline  cells,  and  such  individuals  may  also  be 
at  risk  of  having  affected  children.65  Somatic  mosaicism  in 
NF1  is  thought  to  be  the  cause  of  segmental  neurofibroma¬ 
tosis.66  There  is  inter- individual  variation  in  mitotic  recombi¬ 
nation,67  and  the  genes  that  control  this  process  may  be  an 
important  cause  of  clinical  variability  within  NF1  families. 
Serra  et  al 68  reported  that  mitotic  recombination  was  the 
causal  mechanism  for  the  loss  of  heterozygosity  of  an  NF1 
constitutional  mutation. 

Our  study  gives  further  support  to  the  probable  importance 
of  modifying  loci,  as  suggested  by  Easton  et  aP°  This 
hypothesis  is  supported  by  the  observation  of  many 


investigators  that  identical  NF1  gene  mutations  often  give 
rise  to  very  different  phenotypes  in  unrelated  patients.  GDNF 
is  a  candidate  modifier  gene  in  NF1,69  and  preliminary  work 
in  a  mouse  model  of  NF1  has  also  suggested  the  existence  of 
modifying  genes.70  The  identification  of  modifying  genes  in 
humans  will  be  difficult  and  will  require  large  cohorts  of 
patients  identified  through  multicentre  collaborations,  but 
the  study  of  modifying  genes  may  eventually  allow  more 
accurate  prediction  of  specific  clinical  features  and  complica¬ 
tions  of  NF1  that  could  possibly  lead  to  new  therapeutic 
approaches. 
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Object  The  factors  that  determine  the  growth  rates  of  vestibular  schwannomas  (VSs)  in  patients  with  neurofibro¬ 
matosis  Type  2  (NF2)  are  unknown.  The  authors  undertook  this  study  to  determine  if  clinical  factors  or  type  of  con¬ 
stitutional  NF2  mutation  were  associated  with  VS  growth  rates  in  cases  of  NF2. 

Methods .  The  authors  reviewed  serial  gadolinium-enhanced  magnetic  resonance  (MR)  images  of  the  head  and 
full  spine  of  37  patients  with  sporadic  NF2  who  had  been  observed  over  periods  ranging  from  0.2  to  8  years  (medi¬ 
an  3.9  years)  at  a  specialized  referral  clinic  for  NF2.  A  box  model  was  used  to  calculate  VS  volumes  so  that  tumor 
growth  rates  could  be  estimated.  Temperature-gradient  gel  electrophoresis  was  used  to  screen  for  constitutional  NF2 
mutations.  The  VS  growth  rates  tended  to  decrease  with  increasing  patient  age  at  onset  of  signs  or  symptoms  (r2  = 
0.23,  p  =  0.003)  and  at  the  time  the  baseline  gadolinium-enhanced  MR  image  was  obtained  (r2  =  0.38,  p  <  0.001). 
The  authors  did  not  find  significant  associations  between  VS  growth  rates  and  the  number  of  non-VS  cerebral  or 
spinal  tumors  or  different  types  of  constitutional  NF2  mutations. 

Conclusions .  There  is  considerable  variability  in  growth  rates  of  VSs  in  patients  with  NF2,  but  they  tend  to  be 
higher  in  patients  who  are  younger  at  onset  of  signs  or  symptoms. 

Key  Words  •  natural  history  •  neurogenetics  •  neurofibromatosis  Type  2  • 


vestibular  schwannoma 

Neurofibromatosis  Type  2  is  an  autosomal-dominant 
disorder  that  occurs  with  a  symptomatic  prevalence 
of  1  in  210,000  persons.13  It  is  characterized  by  the 
development  of  benign  nervous  system  tumors  (VSs,  other 
cranial  and  spinal  tumors,  and  peripheral  nerve  tumors),  as 
well  as  ocular  and  cutaneous  abnormalities.12*24’27  Tumors 
associated  with  NF2  are  caused  by  inactivation  of  both  al¬ 
leles  of  the  NF2  tumor-suppressor  gene  by  mutation  or  al¬ 
lele  loss  (Table  l).29*34  The  NF2  protein  (termed  “merlin”  or 
“schwannomin”)  is  highly  homologous  to  proteins  in  the 
moesin-ezrin-radixin  family  that  link  glycophorin  mem¬ 
brane  proteins  to  the  actin  cytoskeleton.29*34  Merlin-binding 
proteins  have  been  identified,  but  the  mechanism  of  the  tu¬ 
mor-suppressor  function  of  merlin  is  unknown.7*16 
Unilateral  sporadic  VSs  are  also  caused  by  inactivation 
of  the  NF2  gene,  but  result  from  somatic  mutations  or  loss 
of  alleles.  The  prevalence  of  sporadic  VSs  is  approximate¬ 
ly  20  times  greater  than  that  of  NF2  and,  although  there 
is  a  finite  possibility  that  a  person  could  experience  inde¬ 
pendent  bilateral  sporadic  VSs,  patients  in  whom  unilater¬ 
al  VSs  develop  at  older  ages  and  who  do  not  meet  the  clini- 


Abbreviations  used  in  this  paper :  logi0  =  logarithm  to  the  base 
10;  MR  =  magnetic  resonance;  NF2  =  neurofibromatosis  Type  2; 
SEt  =  standard  error  of  the  regression  coefficient  b;  TDT  =  tumor 
doubling  time;  VS  =  vestibular  schwannoma. 


cal  diagnostic  criteria  for  NF217  are  highly  unlikely  to  have 
NF2.35  The  growth  rates  of  unilateral  sporadic  VSs  have 
been  studied,6-10*26*33  but  these  findings  may  not  be  gener- 
alizable  to  NF2-associated  VSs.  The  NF2-associated  VSs 
are  more  lobular  and  less  vascular  than  unilateral  sporadic 
VSs31  and  have  higher  proliferation  indices  than  the  sporad¬ 
ic  forms  of  these  tumors.2  Differences  in  the  proliferation 
potential  of  heritable  and  sporadic  VSs  may  be  due  to  dif¬ 
ferences  in  mutational  spectra  or  to  allelic  differences  in  tis¬ 
sues  surrounding  the  tumors,  which  in  NF2  patients  have 
only  one  functional  NF2  allele.2 

The  severity  of  NF2  is  assessed  by  examining  a  compos¬ 
ite  of  clinical  characteristics,  primarily  the  number  of  non- 
VS  cerebral  tumors  and  spinal  tumors,  and  the  patient’s  age 
at  onset  of  signs  or  symptoms.27  Generally,  nonsense  and 
ffameshift  constitutional  NF2  mutations  are  associated  with 
severe  disease,  splice-site  mutations  with  variable  disease 
severity,  and  missense  mutations  with  mild  disease. I4-18*19- 
28, 30  More  rapid  tumor  growth  is  an  obvious  possible  cause 
for  earlier  age  at  onset  and  more  severe  disease,  but  in  one 
small  longitudinal  study  the  researchers  found  higher  tumor 
growth  rates  in  older  patients  with  NF2,1  a  conclusion  that 
contradicts  the  impression  of  many  clinicians.  All  studies 
of  genotype-phenotype  correlations  in  NF2  have  been 
cross  sectional.  In  this  longitudinal  study,  we  evaluated  VS 
growth  rates  in  patients  with  NF2  and  examined  possible 
associations  with  clinical  and  molecular  factors. 
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TABLE  1 


List  of  definitions* 


Term 

Definition 

codon 

nucleotide  triplet  that  specifies  an  amino  acid  or  a 
signal  for  terminating  the  synthesis  of  a  polypep¬ 
tide;  a  stop  codon  terminates  the  synthesis  of  a 
polypeptide 

exon 

segment  of  gene  that  is  decoded  to  provide  a  mes¬ 
senger  RNA  or  mature  RNA  product 

frameshift  mutation 

mutation  that  alters  the  normal  translational  read¬ 
ing  frame  of  a  DNA  sequence 

in-frame  deletion 

deletion  of  a  multiple  of  three  nucleotides  that  puts 
the  reading  frame  back  in  phase 

intron 

noncoding  DNA  that  separates  neighboring  exons 
in  a  gene 

missense  mutation 

nucleotide  substitution  that  results  in  an  amino 
acid  change 

nonsense  mutation 

mutation  that  occurs  within  a  codon  and  changes  it 
into  a  stop  codon 

splice-site  mutation 

mutation  in  the  splice  junction  (exon-intron 
boundary)  that  destroys  signals  for  exon-intron 
splicing! 

translational  reading 

mechanism  that  moves  a  ribosome  three  nucleo¬ 

frame 

tides  at  a  time  during  translation 

*  Based  on  definitions  provided  by  Strachan  and  Read, 
t  A  splice  junction  may  be  a  splice  acceptor  site  (junction  between  the 
end  of  an  intron  and  the  start  of  the  next  exon)  or  a  splice  donor  site  (junc¬ 
tion  between  the  end  of  an  exon  and  the  start  of  the  downstream  intron). 

Clinical  Material  and  Methods 

Patient  Population 

Eighty-eight  patients  with  NF2  were  followed  up  at  the 
Neurology  Department,  Klinikum  Nord  Ochsenzoll,  Ham¬ 
burg,  Germany,  a  specialized  referral  clinic  forNF2,  during 
the  period  from  1988  through  1998.  Eleven  cases  of  inher¬ 
ited  NF2  were  excluded  because  the  known  family  history 
of  NF2  may  have  affected  the  patient’s  age  at  which  the 
diagnosis  of  NF2  was  made  or  the  age  at  apparent  clinical 
onset  of  NF2.  Three  patients  who  were  known  to  have  the 
somatic  mosaic  form  of  NF2  were  also  excluded.  Of  the  re¬ 
maining  74  patients  with  sporadic  NF2,  longitudinal  gado¬ 
linium-enhanced  MR  imaging  was  available  in  37  patients 
and  unavailable  in  the  other  37.  These  two  groups  had  sim¬ 
ilar  distributions  of  measured  clinical  characteristics  and 
types  of  constitutional  NF2  mutations.  All  patients  or  their 
parents  provided  informed  consent,  and  all  patients  met  the 
clinical  diagnostic  criteria  for  definite  NF217  or  had  identi¬ 
fied  constitutional  NF2  mutations. 

Mutation  Analysis 

Deoxyribonucleic  acid  was  extracted  from  peripheral 
lymphocytes  or  directly  from  blood  by  using  a  blood  poly¬ 
merase  chain  reaction  kit  according  to  the  manufacturer’s 
instructions  (QIAmp  Blood  Kit;  Qiagen,  Hilden,  Germa¬ 
ny).  Exons  1  through  15  of  the  NF2  coding  region  were 
scanned  with  temperature-gradient  gel  electrophoresis.19 
No  mutations  have  been  reported  to  date  in  exons  16  or  17 
(M  Baser,  unpublished  data). 

Brain  and  Spine  Imaging  and  Image  Analysis 

We  evaluated  VS  growth  rates  by  retrospectively  ana¬ 
lyzing  full  head  and  spine  gadolinium-enhanced  MR  im¬ 


TABLE2 


Neuroradiological  abnormalities  in  37  patients 
with  new  NF2  mutations 


Abnormality 

No.  of  Patients  (%) 

cranial  nerve  tumors* 

olfactory 

1(3) 

optic  (schwannoma  or  meningioma) 

5(14) 

oculomotor 

4(11) 

trochlear 

1(3) 

trigeminal 

unilat 

16  (43) 

bilat 

6(16) 

vestibulocochlear 

unilat 

2(5) 

bilat 

35  (94) 

hypoglossal 

3(8) 

any  nonvestibulocochlear  cranial 

26  (70) 

nerve  tumor 

intracranial  meningioma 

23  (62) 

spinal  tumor  (all  locations)! 

35  (94) 

cervical 

30  (81) 

thoracic 

28  (76) 

lumbar 

31  (84) 

*  There  were  no  tumors  of  the  sixth,  seventh,  ninth,  10th,  or  1 1th  cranial 
nerves. 

f  Locations  of  spinal  tumors  are  not  mutually  exclusive.  A  patient  could 
be  counted  as  harboring  cervical,  thoracic,  and  lumbar  spinal  tumors. 


ages  that  were  obtained  initially  (the  baseline  measure¬ 
ment)  and  subsequently,  usually  annually.  A  1.5-tesla  MR 
magnet  (Magnetom  63  SP;  Siemens,  Erlangen,  Germany) 
was  used.  The  same  MR  magnet  and  software  were  used 
throughout  the  study  period  (1988-1998).  Sagittal  and  co¬ 
ronal  4-mm  slices  were  examined.  Both  T,-  and  ^-weight¬ 
ed  sagittal  images  were  obtained,  and  Trweighted  coronal 
and  sagittal  images  were  obtained  before  and  after  adminis¬ 
tration  of  gadolinium.  Cervical  spine  images  were  obtained 
using  a  Helmholtz  neck  coil,  and  thoracic  and  lumbar  spine 
images  by  using  a  circularly  polarized  surface  coil.  Tumors 
less  than  4  mm  in  diameter  were  excluded  to  avoid  nonspe¬ 
cific  findings.  All  gadolinium-enhanced  MR  images  were 
assessed  independently  by  two  radiologists,  and  the  mean 
of  each  pair  of  measurements  in  the  two  sets  of  readings 
was  used.  Volumes  of  VSs  were  calculated  on  the  basis  of 
maximum  extensions  in  anteroposterior,  mediolateral,  and 
superoinferior  planes  from  two  dimensions  on  film.  We  did 
not  analyze  computerized  tomography  scans  or  MR  images 
obtained  without  gadolinium. 

Statistical  Analysis 

Preoperative  VS  volume  measurements  were  available 
for  64  VSs  in  37  patients;  two  patients  had  unilateral  VSs 
and  eight  other  patients  had  undergone  VS  excision  before 
two  sequential  gadolinium-enhanced  MR  images  were  ob¬ 
tained.  Growth  rates  were  expressed  as  TDT  in  years  to 
account  for  possible  differences  in  growth  rates  related  to 
the  baseline  VS  volume.  Average  growth  rate  (presented  in 
cmVyear)  was  calculated  using  linear  regression  over  the 
entire  period  of  observation.  The  TDT  was  calculated  as 
follows:  (2  X  baseline  VS  volume)/average  VS  growth  rate 
per  year. 

For  eight  VSs,  the  average  tumor  growth  rate  presented 
in  cubic  centimeters  per  year  was  0  and  the  TDT  could  not 
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Age  at  onset  of  signs  or  symptoms  (years) 

Fig.  1.  Scattergram  comparison  of  VS  growth  rate  and  patient 
age  at  onset  of  signs  or  symptoms  (f2  =  0.23,  p  -  0.003).  Tumor 
doubling  times  of  70  years  are  arbitrarily  assigned  values. 


Age  at  baseline  VS  measurement  (years) 

Fig.  2.  Scattergram  comparison  of  VS  growth  rate  and  patient 
age  at  time  of  baseline  VS  measurement  (r2  =  0.38,  p  <  0.001).  Tu¬ 
mor  doubling  times  of  70  years  are  arbitrarily  assigned  values. 


be  calculated.  We  assigned  an  arbitrary  TDT  of  70  years  to 
these  tumors  and  to  three  VSs  with  negative  growth  rates. 
The  highest  TDT  for  a  single  VS  in  the  study  was  53  years, 
and  only  three  VSs  demonstrated  TDTs  greater  than  30 
years;  the  results  were  very  similar  when  an  arbitrary  TDT 
of  50  years  instead  of  70  years  was  assigned.  In  patients 
with  bilateral  VSs,  growth  rates  and  volumes  from  both 
sides  were  averaged  in  some  analyses  because  there  were 
no  significant  differences  between  the  relationship  of  left- 
and  right-sided  growth  rates  and  volumes  to  covariates.  A 
log10  transformation  was  used  for  the  TDTs  because  the 
doubling  times  ranged  from  0.03  to  70  years  and  were  non- 
normally  distributed. 

The  covariates  studied  were  sex,  age  at  onset  of  signs 
or  symptoms  of  NL2,  age  when  the  baseline  gadolinium- 
enhanced  MR  image  was  obtained,  number  of  intracrani¬ 
al  meningiomas,  number  of  spinal  tumors,  and  type  of  con¬ 
stitutional  NF2  mutation.  All  covariates  were  considered 
fixed.  The  number  of  NF2  tumors  increases  with  patient 
age,  but  in  no  patients  did  new  non-VS  tumors  develop  that 
grew  to  4  mm  or  larger  in  diameter  (the  inclusion  criteri¬ 
on)  during  the  follow-up  period.  The  two-tailed  Fisher  ex¬ 
act  test,  Pearson  correlation  coefficient,  and  linear  regres¬ 
sion  were  used  for  tests  of  association.  The  Mann-Whitney 
U-test  was  used  to  test  between-group  differences  in  TDTs. 
Probability  values  lower  than  0.05  were  considered  to  be 
statistically  significant. 

Results 

Seventeen  (46%)  of  the  37  patients  were  female.  The 
median  age  at  onset  of  signs  or  symptoms  of  NF2  was  12 
years.  The  age  at  onset  was  highly  correlated  with  the  age 
at  which  the  gadolinium-enhanced  MR  image  was  obtained 
(r2  =  0.59,  p  <  0.001),  which  averaged  13  years  later.  The 
median  length  of  observation  with  enhanced  imaging  was 
3.9  years  (range  0.2-8  years),  with  a  median  of  five  serial 
imaging  sessions.  Spinal  tumors  were  found  in  35  patients 
(94%);  in  five  patients  (14%)  the  tumors  were  located  in 


the  thoracic  or  lumbar  spine,  but  not  in  the  cervical  spine 
(Table  2).  Intracranial  meningiomas  were  present  in  62%  of 
patients.  Trigeminal  nerve  tumors  were  the  most  common 
non-VS  cranial  nerve  tumor  (59%  of  patients). 

As  expected,  the  shapes  of  the  VS  growth  curves  were 
highly  variable  and  included  linear,  sigmoid,  logarithmic, 
and  irregular  curves.  Using  a  linear  model  to  calculate 
TDTs,  VS  growth  rates  decreased  (that  is,  log10  TDTs  in¬ 
creased)  with  increasing  patient  age  at  onset  of  signs  or 
symptoms  and  at  the  baseline  imaging  session  (Figs.  1  and 
2).  The  linear  regression  equations  relating  tumor  growth 
rate  to  patient  age  were  the  following:  mean  TDT  in  logI0 
years  =  0.26  4-  0.03  (patient  age  at  onset  of  signs  or  symp¬ 
toms  in  years;  SE,,  =  0.01  years,  r2  =  0.23,  p  =  0.003);  and 
mean  TDT  in  logI0  years  =  —0.24  4-  0.04  (age  at  baseline 
imaging  in  years;  SEb  =  0.01  years,  r2  =  0.38,  p  ,  0.001). 

Twenty-seven  patients  with  48  VSs  underwent  four  or 
more  volume  measurements,  and  sigmoid  curves  could  be 
fit  to  22  VSs  (46%)  in  17  of  these  patients.  A  sigmoid 
growth  curve  was  not  more  likely  to  occur  with  longer  peri¬ 
ods  of  observation. 

Individual  VS  growth  rates  were  highly  variable,  ranging 
from  no  growth  to  a  TDT  of  0.03  years.  The  patient  in  Case 
62,  3  years  old  at  the  time  the  baseline  gadolinium-en¬ 
hanced  MR  images  were  obtained,  was  the  youngest  pa¬ 
tient  in  the  study  and  his  tumor  displayed  unusually  high 
growth  rates.  His  left-  and  right-sided  VSs  grew  at  TDTs  of 
0.03  and  0.05  years,  respectively,  causing  death  3  months 
after  the  baseline  measurements  were  obtained.  These  VS 
growth  rates  were  10  times  greater  than  any  others  in  the 
study,  but  the  associations  of  tumor  growth  rates  with  pa¬ 
tient  age  were  very  similar  when  this  patient  was  excluded 
from  the  analysis. 

Left-  and  right-sided  VS  growth  rates,  but  not  baseline 
VS  volumes,  were  highly  correlated  (Figs.  3  and  4).  The 
linear  regression  equations  that  describe  the  associations 
between  right-sided  VS  and  left-sided  VS  growth  rates  and 
volumes  were  the  following:  right-sided  VS  TDT  in  log10 
years  =  0.24  4-  0.66  (left-sided  VS  TDT  in  log10  years; 
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Left  VS  growth  rate  (TDT,  log  10  years) 

Fig.  3.  Scattergram  comparison  of  growth  rates  of  left-  and 
right-sided  VSs  (r2  =  0.48,  p  <  0.001).  Tumor  doubling  times  of  70 
years  are  arbitrarily  assigned  values. 


SEt =  0.14,  r2  =  0.48,  p  <  0.001);  and  right-sided  VS  base¬ 
line  volume  in  cubic  centimeters  =  5.3  +  0.16  (left-sided 
VS  baseline  volume  in  cubic  centimeters;  Sl\  =  0.1,  r2  = 
0.09,  p  =  0.11). 

The  VS  growth  rates  and  baseline  VS  volumes  did  not 
vary  significantly  with  the  number  of  intracranial  meningi¬ 
omas  or  spinal  tumors. 

Constitutional  NF2  mutations  were  identified  in  23  pa¬ 
tients  (62%),  and  included  eight  frameshift  mutations,  sev¬ 
en  nonsense  mutations,  four  splice  donor  site  mutations, 
three  splice  acceptor  site  mutations,  and  one  in-frame  dele¬ 
tion  (Table  3).  The  median  VS  growth  rate  of  patients  with 
splice-site  mutations  was  2.6-fold  lower  than  that  of  pa¬ 
tients  with  nonsense  or  frameshift  mutations,  but  the  vari¬ 
ability  of  growth  rates  was  very  high  and  this  difference 
was  not  statistically  significant  (p  =  1;  Table  4). 


Fig.  4.  Scattergram  of  left-  and  right-sided  baseline  VS  volumes 
(r2  =  0.09,  p  =  0.1 1).  cc  =  cubic  centimeter. 


TABLE  3 


Identified  constitutional  NF2  mutations  in  23  patients  with  NF2* 


Case  Altered 
No.  Exon 

Altered 

Codon 

Nucleotide  Change 

Consequence 

51 

1 

12 

35-36  del  GC 

frameshift 

68 

2 

56 

166  del  C 

frameshift 

35 

2 

57 

169  C  — »  T 

nonsense 

5 

3 

111 

331  C^T 

nonsense 

18 

3 

118 

352-354  del  TTC 

in-frame  deletion 

27 

5 

agTA  — >  atTA 

splice  acceptor  site 

44 

5 

agTA  — »  aaTA 

splice  acceptor  site 

55 

6 

196 

586  C  — »  T 

nonsense 

40 

7 

+5  g  c 

splice  donor  site 

57 

7 

agGG  ggGG 

splice  acceptor  site 

28 

8 

AGgt  — >  GGgt 

splice  donor  site 

24 

8 

239 

717  del  G 

frameshift 

3 

8 

262 

784  C  ^  T 

nonsense 

4 

8 

262 

784  C^T 

nonsense 

17 

10 

303 

908-909  del  AC 

frameshift 

15 

11 

349 

1047-1053  del 
CTGAACGC 

frameshift 

45 

11 

360 

1079  G  ->  T 

nonsense 

46 

12 

410 

1229  del  A 

frameshift 

14 

12 

428 

1282  C  — »  T 

nonsense 

12 

14 

AAgt  — »  AAct 

splice  donor  site 

52 

14 

490 

1469  del  C 

frameshift 

8 

15 

AGgt  — »  ATgt 

splice  donor  site 

42 

15 

527 

1580  del  A 

frameshift 

*  Some  mutations  have  been  previously  reported  in  studies  by  Kluwe 
and  colleagues  (1996  and  1998).  Abbreviations:  del  =  delete;  +  =  gab  of. 


Discussion 

The  main  findings  in  this  study  included  the  clear  ten¬ 
dency  for  VS  growth  rates  to  be  higher  among  younger  pa¬ 
tients  with  NF2  and  the  high  correlation  of  left-  and  right¬ 
sided  VS  growth  rates  in  patients  with  bilateral  tumors.  The 
presentation  of  NF2  in  children  is  markedly  different  from 
that  in  adults  because  the  presenting  sign  or  symptom  in 
children  is  usually  unrelated  to  the  VSs.11-23-25  Individuals 
who  are  diagnosed  with  NF2  at  younger  ages  have  an  in¬ 
creased  risk  of  mortality.3  Our  finding  that  VS  growth  rates 
are  higher  in  younger  patients  contradicts  the  finding  of 
Abaza  and  colleagues,1  but  is  consistent  with  those  of  Ba¬ 
ser,  et  al.,4  perhaps  because  our  study  and  that  conducted  by 
Baser,  et  al.,  were  based  on  VSs  that  had  never  been  treat¬ 
ed  surgically,  whereas  Abaza  and  colleagues  included  mea¬ 
surements  on  postoperative  VSs.  This  could  lead  to  a  bias 
toward  tumors  in  older  patients  and/or  tumors  displaying 
more  rapid  growth.4 

The  high  correlation  of  VS  growth  rates,  but  not  vol¬ 
umes,  in  patients  with  bilateral  VSs  indicates  that  the  left- 
and  right-sided  VSs  have  grown  at  similar  rates  for  differ¬ 
ent  lengths  of  time.  This  interpretation  is  consistent  with 
stochastic  inactivation  of  the  second  NF2  allele  in  the  two 
Schwann  cells  that  give  rise  to  the  left-  and  right-sided  VSs. 
Further  support  for  the  involvement  of  stochastic  processes 
comes  from  a  study  of  monozygotic  twins  with  NF2,  who 
have  the  same  general  disease  severity  but  differ  with  re¬ 
spect  to  specific  disease  features  and  timing  of  their  occur¬ 
rence.5  The  high  variability  in  VS  growth  rates  from  year 
to  year  in  the  same  individual  also  is  consistent  with  the  in- 
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fluence  of  stochastic  factors  or  complex  interactions  on  VS 
growth  rates. 

Although  we  did  not  observe  an  association  between 
type  of  constitutional  NF2  mutation  and  VS  growth  rates, 
this  study  population  is  too  small  to  draw  conclusions  about 
whether  such  correlations  exist.  Cross-sectional  genotype- 
phenotype  correlation  studies  have  led  to  consistent  find¬ 
ings  of  associations  between  clinical  indices  of  NF2  disease 
severity  and  type  of  constitutional  NF2  mutation.14*18’19*28*30 
Genes  other  than  NF2  may  affect  NF2  disease  severity,9 
schwannoma  tumorigenesis,8*22  and  meningioma  tumori- 
genesis.21  Such  modifying  genes  might  also  influence  VS 
growth  rates. 

A  possible  source  of  bias  in  this  study  may  arise  from  the 
fact  that  prevalent  cases  were  selected  and  the  patients  were 
often  given  the  diagnosis  before  referral  to  our  clinic.  Pa¬ 
tients  with  faster-growing  tumors  would  come  to  surgery 
sooner  and  may  have  been  excluded  from  this  study.  Al¬ 
though  we  are  unable  to  assess  this  bias,  there  is  no  evi¬ 
dence  that  the  patients  with  NF2  who  were  included  in  this 
study  differed  from  those  who  were  referred  but  lacked 
longitudinal  measurements,  because  the  distributions  of 
clinical  characteristics  and  types  of  constitutional  NF2  mu¬ 
tations  were  similar  in  both  groups. 

Another  possible  source  of  bias  may  arise  because  box 
models  were  used  to  estimate  VS  volumes,  rather  than 
precise  computer  volumetries;  however,  volume  measure¬ 
ments  based  on  three  dimensions  provide  a  relatively  good 
approximation.15*20  Spherical  models  have  been  used  to  es¬ 
timate  VS  volumes  when  measurements  are  available  in 
only  two  dimensions,10  but  VSs  can  vary  widely  in  shape; 
many  of  the  VSs  in  this  study  did  not  have  similar  measure¬ 
ments  in  all  three  dimensions,  thus  indicating  nonspherical 
shapes. 

Logarithmic  models  have  been  used  to  calculate  TDTs 
for  VSs  and  other  tumor  types,  particularly  when  only  two 
longitudinal  measurements  have  been  available,10  but  we 
could  fit  only  a  sigmoid  growth  curve  to  approximately  half 
of  the  VSs  for  which  we  had  four  or  more  volume  mea¬ 
surements.  It  has  been  suggested  that  linear  models  of  VS 
growth  are  less  appropriate  as  the  length  of  observation  in¬ 
creases,20  but  we  did  not  find  such  an  association  in  this 
study. 


Conclusions 

Growth  rates  of  VSs  associated  with  NF2  tend  to  be 
higher  in  younger  patients  than  in  older  ones.  Larger 
and  longer  longitudinal  studies  are  needed  to  elucidate 
more  fully  the  clinical  and  genetic  predictors  of  VS  growth 
in  NF2. 
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TABLE  4 

Indices  of  disease  severity  by  constitutional  NF2  mutation  type  in 
37  patients  with  sporadic  NF2 


Percentage 
of  Patients  W/ 
Other  Tumors 


Type  of 
Constitutional 
NF2  Mutation 

No. 

of 

Patients 

Median  TDT  in 
Years  (range)* 

Patient  Age 
at  Onset  of 
Signs/Symp¬ 
toms  (yrs)f 

Intra¬ 

cranial 

Menin¬ 

giomas 

Spinal 

Tumors 

nonsense  or 

15 

5.4  (0.7-70) 

13.3  ±  1.5 

67 

93 

ffameshift 

splice-site 

7 

14.0  (0.3-40.9) 

14.9  ±  4.7 

57 

100 

in-frame 

1 

38.3 

7.0 

100 

100 

deletion 

unidentified 

14 

6.6  (0.04-70) 

17.9  ±4.1 

57 

93 

*  Difference  in  median  TDT  between  patients  with  nonsense  or  frame- 
shift  mutations  and  those  with  splice-site  mutations  (p  =  1,  Mann-Whitney 
U-test). 

t  Values  are  shown  as  means  ±  standard  error  of  mean  where  applicable. 
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Cafe-au-lait  spots  and  neurofibromas  are 
defining  features  of  neurofibromatosis  1 
(NF1),  but  they  vary  greatly  in  number,  size, 
and  clinical  importance  from  patient  to 
patient.  The  cause  of  this  variability  is 
unknown.  We  tested  the  hypotheses  that  de¬ 
velopment  of  these  lesions  is  influenced  by 
local  or  familial  factors.  The  presence  or 
absence  of  cafe-au-lait  spots,  cutaneous  neu¬ 
rofibromas,  and  diffuse  plexiform  neurofi¬ 
bromas  was  recorded  for  each  of  10  divisions 
of  the  body  surface  in  547  NF1  patients, 
including  117  affected  individuals  in  52 
families.  We  used  stratified  Mantel-Haenszel 
tests  to  look  for  local  associations  between 
the  presence  of  diffuse  plexiform  neurofibro¬ 
mas,  cutaneous  neurofibromas,  and  cafe-au- 
lait  spots  in  individual  body  segments  of  NF1 
patients.  We  used  a  random  effects  model  to 
obtain  intrafamilial  correlation  coefficients 
for  the  age-adjusted  number  of  body  divi¬ 
sions  affected  with  each  of  the  three  lesions. 
No  significant  association  was  observed 
between  the  occurrence  of  cutaneous  and 
diffuse  plexiform  neurofibromas,  between 
cafe-au-lait  spots  and  cutaneous  neurofibro¬ 
mas,  or  between  cafe-au-lait  spots  and  plexi¬ 
form  neurofibromas  in  the  same  body 
segment.  The  correlation  among  relatives 
in  the  number  of  body  segments  affected 
with  cafe-au-lait  spots  was  0.45  (95%  con- 
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fidence  interval  [Cl]  =  0.18-0.71),  with  cuta¬ 
neous  neurofibromas,  0.37  (95%  Cl  =  0.15- 
0.55),  and  with  plexiform  neurofibromas, 
0.35  (95%  Cl  =  0.15-0.57).  We  conclude  that 
the  development  of  cafe-au-lait  spots,  cuta¬ 
neous  neurofibromas,  and  plexiform  neuro¬ 
fibromas  are  spatially  independent  in  NF1 
patients  but  that  the  development  of  all  three 
lesions  is  influenced  by  familial  factors. 

©  2003  Wiley-Liss,  Inc. 
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INTRODUCTION 

Neurofibromatosis  1  (NF1)  is  an  autosomal  dominant 
condition  characterized  by  extremely  variable  expres¬ 
sivity.  Cafe-au-lait  spots  and  neurofibromas  are  the 
defining  features.  Neurofibromas  are  complex  benign 
tumors  arising  in  the  fascicles  of  peripheral  nerves 
[Korf,  1999].  Histologically,  a  local  increase  in  endo- 
neurial  matrix  of  the  fascicle  is  accompanied  by  a 
thickened  perineurium,  increased  size  and  number  of 
Schwann  cells  [Harkin  and  Reed,  1969;  Woodruff,  1999], 
and  increased  numbers  of  mast  cells  and  fibroblasts 
[Giorno  et  al.,  1989].  Cutaneous  neurofibromas  are 
confined  to  a  single  fascicle  within  a  nerve,  while  diffuse 
plexiform  neurofibromas  involve  multiple  fascicles 
[Burger  and  Scheithauer,  1994]. 

Cutaneous  neurdfibromas  begin  to  appear  in  mid¬ 
childhood  and  eventually  develop  in  almost  all  NF1 
patients  [Friedman  and  Riccardi,  1999;  DeBella  et  al., 
2000],  Cutaneous  neurofibromas  tend  to  increase  in 
number  and  size  with  age.  Some  adults  with  NF1  have 
hundreds  or  thousands  of  these  lesions;  other  NF1 
patients  develop  only  a  few  cutaneous  neurofibromas 
throughout  life. 

Diffuse  plexiform  neurofibromas  are  almost  always,  if 
not  always,  congenital  [Friedman  and  Riccardi,  1999]. 
Many  are  apparent  on  surface  examination,  although 
they  often  extend  into  deeper  tissues.  Some  diffuse 
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plexiform  neurofibromas  involve  only  deeper  tissues 
and  are  not  apparent  on  physical  examination.  Plexi¬ 
form  neurofibromas  tend  to  be  larger  than  cutaneous 
neurofibromas,  sometimes  involving  an  entire  limb  or 
other  part  of  the  body.  Plexiform  neurofibromas  may 
give  rise  to  malignant  peripheral  nerve  sheath  tumors, 
but  discrete  cutaneous  neurofibromas  rarely,  if  ever,  do. 

Cafe-au-lait  spots  are  pigmented  macules.  Histologi¬ 
cally,  they  contain  melanocytes  with  abnormally  large 
pigment  particles  [Fitzpatrick,  1981].  Cafe-au-lait  spots 
may  be  present  at  birth,  and  by  1  year  of  age  almost  all 
children  with  NF1  have  six  or  more  of  these  lesions 
[Friedman  and  Riccardi,  1999;  DeBella  et  al. ,  2000] .  The 
number  and  location  of  cafe-au-lait  spots  and  neurofi¬ 
bromas  are  highly  variable,  even  among  NF 1  patients  of 
similar  age.  The  cause  of  this  variability  is  unknown. 
Here  we  test  the  hypotheses  that  the  development  of 
these  lesions  is  influenced  by  local  or  familial  factors. 

SUBJECTS  AND  METHODS 
Subjects 

Five  hundred  and  forty-seven  NF1  patients,  including 
117  affected  individuals  in  52  families,  who  had 
information  recorded  on  spatial  distribution  of  skin 
lesions  were  available  in  the  NF  Institute  Database 
[Riccardi,  1992].  All  of  these  patients  were  evaluated  by 
Dr.  Vincent  Riccardi,  and  all  meet  the  NIH  diagnostic 
criteria  for  NF1  [National  Institutes  of  Health  Con¬ 
sensus  Development  Conference,  1988;  Gutmann  et  al., 

1997]  .  For  each  patient,  the  presence  of  one  or  more  cafe- 
au-lait  spots,  one  or  more  cutaneous  neurofibromas,  and 
one  or  more  diffuse  plexiform  neurofibromas  was 
recorded  for  each  of  the  ten  divisions  of  the  body  surface 
shown  in  Figure  1. 

Cafe-au-lait  spots  were  defined  as  light  brown 
macules  with  a  greatest  diameter  of  at  least  5  mm.  In 
addition  to  being  larger  than  freckles,  cafe-au-lait  spots 
usually  do  not  occur  in  clusters,  whereas  freckles  almost 
always  do  in  people  with  NF1.  Discrete  dermal  neuro¬ 
fibromas  were  classified  as  either  cutaneous  or  sub¬ 
cutaneous.  Cutaneous  neurofibromas  lie  in  the  skin 
and  move  with  it;  the  skin  can  be  moved  over  the  top 
of  subcutaneous  neurofibromas,  which  He  deeper.  Sub¬ 
cutaneous  neurofibromas  were  not  analysed  in  this 
study  because  they  were  too  infrequent  for  statistical 
evaluation. 

Analysis  of  Local  Effect 

We  used  two-layered  Mantel-Haenszel  tests  [SPSS, 

1998]  to  look  for  local  associations  between  the  presence 
of  diffuse  plexiform  neurofibromas  and  cutaneous 
neurofibromas  in  individual  body  segments  of  each 
NF1  patient.  We  stratified  simultaneously  by  the  body 
segment  being  considered  and  by  the  number  of  other 
body  segments  with  one  or  more  cutaneous  neurofibro¬ 
mas  (a  categorical  variable  with  range  0-9).  This 
stratification  was  used  to  adjust  for  the  fact  that  an 
NF1  patient  who  has  a  larger  total  number  of  body 
segments  with  one  or  more  neurofibromas  is  more  likely 
to  have  at  least  one  neurofibroma  in  any  particular 


Fig.  1.  Body  segment  scheme  used  by  Neurofibromatosis  Institute 
Database. 


segment  than  an  NF1  patient  who  has  fewer  total  body 
segments  affected.  This  approach,  which  takes  all 
factors  that  affect  the  likelihood  of  developing  neuro¬ 
fibromas  into  account,  was  used  instead  of  stratifying 
by  age  alone  because  other  clinical  factors  also  affect 
the  likelihood  that  an  NF1  patient  will  have  neuro¬ 
fibromas  [Szudek  et  al.,  2000a;  Szudek  et  al.,  2003], 

Confidence  intervals  for  the  summary  odds  ratio  (OR) 
were  obtained  using  a  jackknife  based  on  20  different 
subgroups — a  number  that  is  sufficiently  large  to 
produce  a  stable  estimate  [Miller,  1974].  Homogeneity 
was  assessed  using  the  Breslow-Day  test  [SPSS,  1998]. 
Local  associations  between  cafe-au-lait  spots  and  cuta¬ 
neous  neurofibromas  and  between  cafe-au-lait  spots  and 
plexiform  neurofibromas  were  analyzed  in  the  same 
manner. 

Skin  Surface  Area 

The  body  divisions  used  in  this  study  cover  varying 
amounts  of  skin  surface  area,  so  we  checked  for  an 
association  between  surface  area  and  the  presence  of  one 
or  more  cutaneous  neurofibromas  in  a  segment.  Using 
logistic  regression,  we  set  the  segment  area  as  the  in¬ 
dependent  variable  and  the  presence  or  absence  of 
cutaneous  neurofibromas  as  the  dependent  variable. 
We  tested  in  a  similar  manner  for  associations  between 
surface  area  and  the  presence  of  diffuse  plexiform 
neurofibromas  or  cafe-au-lait  spots  in  a  segment.  Since 
the  median  age  of  our  patients  was  13  years,  we 


approximated  the  surface  area  of  the  body  segments  by 
using  standard  percentages  for  10-14  year-old  indivi¬ 
duals  [McManus  and  Pruitt,  1996].  The  proportions  of 
total  surface  area  assigned  to  each  body  segment  were: 
head  =  11%,  neck  =  2%,  right  upper  torso  =  12%,  left 
upper  torso  =  12%,  right  lower  torso  =  4%,  left  lower 
torso  =  4%,  right  arm  =  9.5%,  left  arm  =  9.5%,  right 
leg=  18%,  and  left  leg=  18%. 

Total  Number  of  Neurofibromas 

In  addition  to  data  on  whether  each  body  segment  was 
affected  by  one  or  more  cutaneous  neurofibromas,  com¬ 
plete  counts  of  cutaneous  neurofibromas  were  available 
for  44  of  the  patients.  The  total  number  of  neurofibromas 
in  these  patients  ranged  from  none  to  several  hundred 
and  appeared  to  increase  logarithmically  with  the 
number  of  affected  segments.  We  used  linear  regression 
[SPSS,  1998]  to  test  the  relationship  between  log- 
transformed  counts  of  the  total  number  of  cutaneous 
neurofibromas  in  an  individual  and  the  number  of  body 
segments  that  included  one  or  more  cutaneous  neurofi¬ 
bromas.  Counts  of  the  total  number  of  cafe-au-lait  spots 
were  not  made,  and  few  subjects  had  more  than  one 
plexiform  neurofibroma,  so  these  variables  were  not 
analyzed  in  this  manner. 

Familial  Analysis 

For  the  familial  analysis,  we  stratified  subjects  into 
5-year  age  intervals,  calculated  the  deciles  for  the  total 
number  of  segments  affected  with  cutaneous  neurofi¬ 
bromas  in  each  stratum,  and  ranked  each  subject  by 
decile  for  the  stratum  in  which  he  or  she  lay.  We  then 
used  random  effects  models  to  obtain  maximum  like¬ 
lihood  estimates  and  confidence  intervals  for  intrafami- 
lial  correlation  coefficients  for  rank  [Spjotvoll,  1967; 
Donner  et  al.,  1989].  Cafe-au-lait  spots  and  plexiform 
neurofibromas  were  analyzed  in  the  same  manner. 

RESULTS 

We  studied  the  distribution  of  cafe-au-lait  spots, 
cutaneous  neurofibromas,  and  diffuse  plexiform  neuro- 
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fibromas  in  10  segments  of  the  body  surface  (Fig.  1)  in 
each  of  547  patients  with  NF1.  Two  hundred  eighty-one 
(51.4%)  of  the  subjects  were  female,  and  266  (48.6%) 
were  male.  Four  hundred  twenty-six  (77.9%)  were 
white,  67  (12.2%)  were  Hispanic,  44  (8.0%)  were  black 
and  10  (1.8%)  were  of  other  or  mixed  origin.  The  mean 
age  was  17.5  years,  and  the  median  age  was  13  years. 
The  youngest  patient  included  was  a  newborn  infant, 
and  the  oldest  was  64.3  years  of  age. 

Lesion  Frequency  by  Body  Segment 

Table  I  shows  the  frequency  of  these  lesions  in  each 
of  the  10  body  segments.  Three  hundred  thirty-seven 
patients  had  one  or  more  cutaneous  neurofibromas,  and 
216  patients  had  at  least  one  plexiform  neurofibroma. 
Cutaneous  and  plexiform  neurofibromas  occurred  with 
similar  frequencies  in  all  ten  body  segments.  Cafe-au- 
lait  spots  were  observed  in  almost  all  patients  and  had 
similar  frequencies  in  all  segments  except  the  head, 
where  these  lesions  were  less  frequent. 

No  Associations  Between  Lesion  Types 
Within  Individual  Body  Segments 

Table  II  shows  the  ten  body  segments  examined  and 
the  odds  ratios  for  associations  of  each  pair  of  lesions 
for  each  segment.  No  association  was  observed  between 
the  occurrence  of  cutaneous  and  diffuse  plexiform 
neurofibromas  in  the  same  body  segment.  The  summary 
odds  ratio  (OR)  was  1.20  (95%  confidence  interval 
[Cl]  =  0.81-1.79).  There  was  no  evidence  for  hetero¬ 
geneity  across  body  segments  (P  =  0.37). 

Similarly,  there  was  no  association  between  the  pre¬ 
sence  of  cafe-au-lait  spots  and  either  cutaneous  or  dif¬ 
fuse  plexiform  neurofibromas  within  a  single  body 
segment.  There  was  significant  (P  =  0.03)  heterogeneity 
in  the  occurrence  of  cutaneous  neurofibromas  and  cafe- 
au-lait  spots,  with  a  positive  association  seen  in  the  neck 
(OR  =  2.94;  95%  Cl  =  1.20-7.20).  No  evidence  of  hetero¬ 
geneity  across  body  segments  was  found  for  the 
occurrence  of  plexiform  neurofibromas  and  cafe-au-lait 
spots  (P=0.52). 


TABLE  I.  Number  and  Percentage  of  547  NF1  Patients  Who  Have  One  or  More  Cutaneous 
Neurofibromas,  Diffuse  Plexiform  Neurofibromas,  or  Cafe-au-lait  Spots  in  Each  of 

10  Body  Segments 

Cutaneous  Plexiform  Cafe-au-lait 

neurofibromas  neurofibromas  spots 


Segment 

Total 

(%> 

Total 

<%) 

Total 

(%) 

1.  Head 

179 

(33) 

47 

(8) 

101 

(18) 

2.  Neck 

168 

(31) 

29 

(5) 

397 

(73) 

3.  Right  upper  torso 

259 

(47) 

32 

(6) 

532 

(97) 

4.  Left  upper  torso 

258 

(47) 

21 

(4) 

531 

(97) 

5,  Right  lower  torso 

285 

(52) 

55 

(10) 

537 

(98) 

6.  Left  lower  torso 

287 

(52) 

41 

(7) 

533 

(97) 

7.  Right  arm 

206 

(38) 

21 

(4) 

514 

(94) 

8.  Left  arm 

208 

(38) 

19 

(3) 

511 

(93) 

9.  Right  leg 

219 

(40) 

54 

(10) 

527 

(96) 

10.  Left  leg 

220 

(40) 

45 

(8) 

525 

(96) 

Total 

337 

(62) 

216 

(39) 

543 

(99) 
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TABLE  H  Associations  Between  Cutaneous  Neurofibromas,  Diffuse  Plexiform  Neurofibromas,  and  Cafe-au-lait  Spots  by  Body 
_ '  _  Segment  in  547  NF1  Patients 


Cutaneous  and  plexiform 
neurofibromas 

Cutaneous  neurofibromas 
and  cafe-au-lait  spots 

Cafe-au-lait  spots  and  plexiform 
neurofibromas 

Segment 

OR 

(95%  CD 

OR 

(95%  CD 

OR 

(95%  CD 

1.  Head 

0.95 

(0.34-2.68) 

1.34 

(0.67-2.67) 

1.26 

(0.60-2.65) 

2.  Neck 

2.39 

(0.51-11.20) 

2.59 

(1.23-5.47) 

2.42 

(0.71-8.24) 

3.  Right  upper  torso 

0.83 

(0.23-3.02) 

0.26 

(0.01-10.34) 

4.  Left  upper  torso 

0.39 

(0.06-2.49) 

0.12 

(0.01-7.07) 

1.29 

(0.02-83.37) 

5.  Right  lower  torso 

0.85 

(0.32-2.24) 

0.98 

(0.01-84.41) 

-  6.  Left  lower  torso 

0.91 

(0.35-2.36) 

1.13 

(0.19-6.94) 

0.06 

(0.01-0.99) 

7.  Right  arm 

1.17 

(0.09-14.43) 

1.91 

(0.29-12.67) 

8.  Left  arm 

1.01 

(0.18-5.60) 

0.91 

(0.18-4.65) 

0.22 

(0.02-1.97) 

9.  Right  leg 

3.91 

(1.02-15.06) 

0.20 

(0.04-1.15) 

0.35 

(0.05-2.34) 

10.  Left  leg 

3.60 

(0.99-13.08) 

1.10 

(0.24-5.00) 

2.70 

(0.17-44.12) 

Summary 

1.20 

(0.81-1.79) 

1.36 

(0.91-2.03) 

1.25 

(0.74-2.12) 

Odds  ratios  (OR)  could  not  be  calculated  for  the  association  of  cafe-au-lait  spots  and  plexiform  neurofibibromas  in  the  right  upper  torso,  right  lower  torso,  or 
right  arm  because  there  were  no  patients  who  had  plexiform  neurofibromas  but  did  not  have  cafe-au-lait  spots  in  these  segments. 


Log-Linear  Relationship  Between 
Segment  Size  and  Number  of 
Cutaneous  Neurofibromas 

The  number  of  body  segments  affected  with  one  or 
more  cutaneous  neurofibromas  was  strongly  correlated 
with  the  total  number  of  cutaneous  neurofibromas  in  44 
NF1  patients  in  whom  both  total  counts  and  data  on 
the  number  of  affected  body  segments  were  available 
(r  — 0.95,  P<  0.001).  The  relationship  is  log  linear;  the 
regression  equation  is: 

log(total  number  of  neurofibromas  +  1) 

=  0.23  x  (number  of  segments  affected)  +  0.014 

We  observed  no  significant  association  between  the 
relative  size  of  the  body  surface  area  in  a  segment  and 
the  presence  of  one  or  more  cutaneous  neurofibromas 
(P  =  0. 18)  or  of  a  diffuse  plexiform  neurofibroma 
(P  —  0.23).  In  contrast,  an  association  was  found  be¬ 
tween  the  presence  of  one  or  more  cafe-au-lait  spot  in  a 
body  segment  and  its  surface  area  expressed  as  a 
percentage  of  the  body’s  total  (P<  0.001,  OR  =  1.030, 
95%  Cl  =  1,015-1.046). 

All  Three  Lesions  Are  Correlated  Among 
Relatives  With  NF1 

We  estimated  intrafamilial  correlations  in  the  age- 
adjusted  number  of  body  segments  that  included  one  or 
more  cafe-au-lait  spots,  one  or  more  cutaneous  neurofi¬ 
bromas,  or  one  or  more  plexiform  neurofibromas  in  117 
affected  members  of  52  families.  We  found  significant 
intrafamilial  correlations  for  the  number  of  body 
segments  affected  by  each  of  these  clinical  features. 
The  intrafamilial  correlation  coefficient  for  the  number 
of  body  segments  affected  with  cafe-au-lait  spots  was 
0.45  (95%  Cl  =  0.18-0.71).  The  correlation  among  re¬ 
latives  with  NF1  for  the  number  of  body  segments 
affected  with  cutaneous  neurofibromas  was  0.37  (95% 
Cl  =  0.15-0.55).  The  correlation  coefficient  among  re¬ 


latives  for  the  number  of  body  segments  affected  with 
plexiform  neurofibromas  was  0.35  (95%  Cl  =  0.15-0.57). 


DISCUSSION 

Lesions  in  Body  Segments 
of  Individual  Patients 

We  found  a  very  high  correlation  (r  =  0.95)  between 
the  number  of  body  segments  in  which  one  or  more 
cutaneous  neurofibromas  was  present  and  the  total 
number  of  cutaneous  neurofibromas  in  44  patients  in 
whom  counts  were  available.  Counting  all  cutaneous 
neurofibromas  in  an  adult  with  NF1  is  very  tedious,  and 
our  observation  provides  the  basis  for  using  the  number 
of  body  segments  affected  by  cutaneous  neurofibromas 
as  a  surrogate  for  the  total  number  of  cutaneous 
neurofibromas  in  clinical  trials  and  studies  of  geno- 
type-phenotype  correlations.  It  seems  likely  that  a 
similar  relationship  exists  between  the  number  of  body 
segments  affected  with  cafe-au-lait  spots  or  plexiform 
neurofibromas  and  the  severity  of  each  of  these  disease 
features,  but  we  did  not  have  information  on  total  counts 
of  these  lesions  available  to  demonstrate  this. 

Since  almost  all,  if  not  all,  diffuse  plexiform  neurofi¬ 
bromas  are  of  congenital  origin  [Friedman  and  Riccardi, 
1999],  we  wanted  to  find  out  if  they  influence  the 
subsequent  development  of  cutaneous  neurofibromas. 
Our  findings  indicate  that  the  occurrence  of  cutaneous 
neurofibromas  in  NF1  patients  is  not  strongly  influ¬ 
enced  by  the  local  presence  of  a  diffuse  plexiform 
neurofibroma.  In  fact,  we  found  that  all  three  of  the 
lesions  studied  (cafe-au-lait  spots,  cutaneous  neurofi¬ 
bromas,  and  plexiform  neurofibromas)  occurred  inde¬ 
pendently  of  each  another  in  almost  all  of  the  body 
segments  analyzed  (Table  II).  We  found  a  significant 
association  between  cafe-au-lait  spots  and  cutaneous 
neurofibromas  only  in  the  neck.  One  possible  reason  the 
neck  might  be  affected  by  both  lesions  is  recurrent  minor 
trauma  to  the  skin  associated  with  flexion,  extension, 
and  rotation  of  the  head  [Riccardi,  1990]. 


The  independent  occurrence  of  cutaneous  neurofibro¬ 
mas,  plexiform  neurofibromas,  and  cafe-au-lait  spots 
in  most  body  regions  is  consistent  with  the  possibility 
that  somewhat  different  pathogenic  mechanisms  are 
involved  in  development  of  these  three  lesions.  Families 
with  NFI  mutations  who  develop  cafe-au-lait  spots 
but  no  tumors  have  been  reported  [Abeliovich  et  al., 
1995],  an  observation  that  is  consistent  with  different 
pathogenic  factors  being  involved  in  the  development 
of  cafe-au-lait  spots  and  neurofibromas.  In  mouse 
models  of  NF1,  cutaneous  and  plexiform  neuro¬ 
fibromas  can  develop  by  different  pathogenic  pathways 
[Feigenbaum  et  al.,  1996;  Cichowski  et  al.,  1999;  Vogel 
etal.,  1999]. 

Familial  Correlations 

The  intrafamilial  correlations  we  observed  for  cuta¬ 
neous  neurofibromas  and  cafe-au-lait  spots  in  NF1 
patients  are  consistent  with  the  findings  of  a  previous 
study  [Easton  et  al.,  1993].  The  number  of  familial 
patients  and  the  prevalences  of  all  three  lesions  were 
similar  in  these  two  studies.  Our  study  found  a  similar 
correlation  for  cafe-au-lait  spots  but  higher  correlation 
coefficients  for  cutaneous  neurofibromas  than  Easton 
and  his  associates  did.  We  also  found  a  significant 
familial  correlation  for  plexiform  neurofibromas.  Easton 
et  al.  [1993]  analyzed  this  feature  as  a  discrete  (present/ 
absent)  trait  and  found  no  familial  association. 

We  have  also  studied  the  familiality  of  cafe-au-lait 
spots,  cutaneous  neurofibromas,  and  plexiform  neuro¬ 
fibromas  as  discrete  traits  in  an  independent  series  of 
NF1  patients  [Szudek  et  al. ,  2000b;  Szudek  et  al. ,  2002] . 
The  results  of  that  study  are  consistent  with  the 
current  one  and  with  the  study  of  Easton  and  associates 
[1993]  despite  differences  in  design  and  methodology: 
We  again  found  strong  intrafamilial  correlations  for 
cafe-au-lait  spots  (r  =  0.43,  95%  Cl  =  0.29-0.57)  and 
cutaneous  neurofibromas  (r  =  0.49,  95%  Cl  =  0.33- 
0.65).  Like  Easton  et  al.  [1993]  we  did  not  find  a 
correlation  for  the  occurrence  of  plexiform  neurofibro¬ 
mas  considered  as  a  discrete  trait  when  all  relatives 
were  considered,  but  we  did  find  a  significant  sib-sib 
correlation  for  the  occurrence  of  this  clinical  feature 
(r  =  0.18,  95%  Cl  =  0.04-0.32).  These  observations 
provide  further  evidence  for  the  importance  of  familial 
factors  in  the  development  of  cafe-au-lait  spots  and 
neurofibromas  in  people  with  NF1. 

The  genetic  basis  for  these  familial  associations  has 
not  been  determined,  but  contributing  factors  may 
include  effects  of  the  mutant  NF1  allele  itself,  effects 
of  the  normal  NF1  allele,  or  modifying  effects  of  other 
loci.  The  number  of  individuals  included  in  our  study 
was  too  small  to  permit  comparison  of  the  strength  of 
the  correlation  among  relatives  of  various  classes, 
which  would  be  necessary  to  distinguish  these  factors 
statistically.  The  moderate  magnitudes  of  the  intra¬ 
familial  correlation  coefficients  we  found  show  that 
familial  factors  alone  are  insufficient  to  predict  the 
degree  to  which  a  patient  will  be  affected  by  these 
lesions. 
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Analysis  of  Intrafamilial  Phenotypic 
Variation  in  Neurofibromatosis  1  (NF1) 
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The  relationship  of  genetic  factors  to  variable  expressivity  in  neurofibromatosis  1 
(NF1)  is  poorly  understood.  We  examined  familial  aggregation  of  NF1  features 
among  different  classes  of  affected  relatives.  Clinical  information  was  obtained 
from  the  National  NF  Foundation  International  Database  on  904  affected 
individuals  in  373  families  with  2  or  more  members  with  NF1.  We  used 
multivariate  probit  regression  to  measure  the  associations  between  various  classes 
of  relatives  for  each  of  10  clinical  features  of  NF1,  while  simultaneously  adjusting 
for  covariates  including  related  features,  age,  and  gender.  Two  distinct  patterns 
were  observed  when  we  compared  associations  between  first-  and  second-degree 
relatives,  sibs,  and  parent-child  pairs:  Lisch  nodules  and  cafe-au-lait  spots  had 
greater  associations  between  first-degree  relatives  than  between  second-degree 
relatives,  while  subcutaneous  neurofibromas,  plexiform  neurofibromas,  cafe-au- 
lait  spots,  and  intertriginous  freckling  had  greater  associations  between  sibs  than 
between  parents  and  children.  In  addition,  Lisch  nodules,  subcutaneous 
neurofibromas,  and  cutaneous  neurofibromas  had  greater  associations  between 
affected  fathers  and  children  than  between  affected  mothers  and  children.  These 
familial  patterns  suggest  that  unlinked  modifying  genes  and  the  normal  NF1 
allele  may  both  be  involved  in  the  development  of  particular  clinical  features  of 
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NF1,  but  that  the  relative  contributions  vary  for  different  features.  Genet. 
Epidemiol.  23:150-164,  2002.  ©  2002  Wiley-Liss,  Inc. 
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INTRODUCTION 

Neurofibromatosis  1  (NF1)  is  an  autosomal-dominant  disease  that  affects  about 
1/3,500  people  [Friedman,  1999].  NF1  can  affect  the  skin,  skeleton,  and  nervous 
system,  and  is  characterized  by  highly  variable  expressivity  [Friedman  et  al.,  1999], 
Many  disease  features  are  progressive,  but  the  rate  of  progression  and  the  occurrence 
of  serious  manifestations  vary  greatly  from  one  patient  to  another  [Friedman  and 
Riccardi,  1999].  This  variability  and  the  confounding  effect  of  age  have  hindered 
efforts  to  characterize  the  relationship  of  genetic  factors  at  the  NFJ  locus  or  other 
loci  to  disease  variability. 

More  than  400  different  constitutional  NF1  mutations  have  been  reported 
[Fahsold  et  al.,  2000;  Korf,  1999;  Messiaen  et  al.,  2000],  In  general,  little  evidence  has 
been  found  of  allele-phenotype  correlations  in  NF1,  although  a  more  or  less 
consistent  phenotype  occurs  in  association  with  deletions  involving  the  entire  NF1 
gene  [Dorschner  et  al.,  2000;  Tonsgard  et  al.,  1997].  Similar  clinical  features  were 
observed  among  affected  members  of  a  few  families  with  the  NF1  variants  Watson 
syndrome  [Allanson  et  al.,  1991],  familial  cafe-au-lait  spots  [Abeliovich  et  al.,  1995], 
or  familial  spinal  neurofibromas  [Ars  et  al.,  1998;  Poyhonen  et  al.,  1997;  Pulst  et  al., 
1991],  This  observation  is  consistent  with  an  allele-phenotype  correlation,  but  no 
consistent  kind  of  NF1  mutation  has  been  found  in  families  with  these  or  other 
phenotypic  variants.  Affected  members  of  a  single  family  with  typical  NF1  often 
have  quite  different  disease  phenotypes,  despite  sharing  an  identical  mutant  NF1 
allele.  Clearly,  variation  in  the  mutant  NFJ  allele  itself  does  not  account  for  all  of  the 
variability  seen  in  most  disease  features. 

Easton  et  al.  [1993]  studied  the  expressivity  of  NF1  in  175  affected  members  of 
48  families,  and  found  statistically  significant  correlations  for  the  number  of  cafe-au- 
lait  spots,  the  number  of  dermal  discrete  neurofibromas,  and  head  circumference 
among  affected  relatives.  Comparison  of  the  strength  of  these  correlations  in 
relatives  of  different  classes  provided  evidence  for  modifying  genes  influencing  the 
number  of  cafe-au-lait  spots. 

We  showed  that  several  statistically  significant  associations  exist  between  the 
occurrence  of  individual  clinical  features  in  3,067  unrelated  probands  with  NF1 
[Szudek  et  al.,  2000b;  Szudek  et  al.,  unpublished  findings].  We  also  found  significant 
associations  in  the  occurrence  of  Lisch  nodules,  optic  glioma,  learning  disability, 
macrocephaly,  and  short  stature  in  affected  parent-child  pairs  [Szudek  et  al.,  2000b], 
but  made  no  attempt  to  adjust  for  the  nonindependence  of  multiple  relative-pairs 
from  the  same  family  or  for  associations  among  clinical  features  in  individuals  in  this 
preliminary  study.  We  now  extend  our  analysis  to  measure  correlations  of  NF1 
features  among  relatives  of  various  classes,  using  methods  that  take  other  clinical 
features,  gender,  and  age  into  account,  and  adjust  for  the  nonindependence  of 
affected  relatives.  By  comparing  the  correlations  observed,  we  provide  evidence  that 
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genetic  sources  of  variation  are  generally  important  in  NF1  and  vary  for  different 
clinical  features. 

SUBJECTS  AND  METHODS 
Subjects 

All  patients  in  this  study  met  the  NIH  diagnostic  criteria  for  NF1  [Gutmann 
et  al.,  1997;  NIH,  1988].  Data  were  obtained  from  the  National  NF  Foundation 
International  Database  (NFDB)  [Friedman  et  al.,  1993].  The  NFDB  contains 
extensive  demographic,  clinical,  and  genetic  data  on  NF1  patients  from  more  than  20 
participating  clinical  centers  in  North  America,  Europe,  Japan,  and  Australia.  All 
information  is  recorded  using  a  standard  format  and  consistent  definitions  of  clinical 
features.  The  greatest  strength  of  the  NFDB  is  its  large  size.  The  limitations  include 
the  fact  that  data  are  contributed  by  staff  members  of  specialized  neurofibromatosis 
clinics,  which  probably  produces  an  ascertainment  bias. 

At  the  time  of  this  study,  the  available  dataset  included  373  families  with  two  or 
more  affected  members,  for  a  total  of  904  individuals  with  NF1.  Three  hundred 
forty-six  of  these  were  nuclear  families  that  included  either  an  affected  parent  and 
one  or  more  affected  children  or  two  or  more  affected  sibs.  Twenty-seven  families 
were  more  extended,  including  a  total  of  74  second-degree  affected  relative-pairs. 
The  family  sizes  ranged  from  2-7  affected  members.  There  were  272  families  with  2, 
65  with  3, 24  with  4,  6  with  5,  3  with  6,  and  3  with  7  affected  members  included  in  the 
study. 

For  analysis  of  familiality,  we  selected  10  clinical  features  of  NF1:  cafe-au-lait 
spots,  intertriginous  freckling,  Lisch  nodules,  cutaneous  neurofibromas,  subcuta¬ 
neous  neurofibromas,  plexiform  neurofibromas,  seizures,  scoliosis,  optic  glioma,  and 
neoplasms  other  than  neurofibromas  or  optic  gliomas  (“other  neoplasms”).  Most  of 
these  features  were  identified  by  physical  examination,  and  all  were  treated  as  binary 
variables.  Cafe-au-lait  spots  were  coded  as  “present”  if  the  subject  had  six  or  more 
spots.  Cutaneous  or  subcutaneous  neurofibromas  were  coded  as  “present”  if  the 
subject  had  two  or  more  lesions  of  the  same  type.  Plexiform  neurofibroma  was  coded 
as  “present”  if  the  subject  had  one  or  more  lesions.  Lisch  nodules  were  diagnosed  or 
excluded  by  slit-lamp  examination.  The  presence  or  absence  of  optic  glioma  was 
determined  by  cranial  MRI  or  CT  examination.  Only  patients  with  definite  presence 
or  absence  of  a  feature  were  considered  in  models  involving  that  feature.  The 
complete  dataset  used  in  this  study  is  available  from  the  authors  on  request. 

Statistical  Methods 

Familial  correlations  in  each  class  of  relatives  were  estimated  for  clinical 
features  measured  as  binary  (presence/absence)  variables  by  means  of  a  multivariate 
probit  model  [Ashford  and  Sowden,  1970;  Joe,  1995;  Mendell  and  Elston,  1974].  For 
a  particular  binary  response  variable,  the  covariates  used  for  adjustment  were  chosen 
on  the  basis  of  a  univariate  probit  or  logistic  stepwise  regression,  ignoring  familial 
dependence.  These  covariates  were  then  used  in  the  multivariate  probit  model.  We 
included  age  as  a  covariate  in  all  analyses,  because  many  NF1  features  have  a  higher 
prevalence  in  older  patients  [DeBella  et  al.,  2000],  We  showed  previously  that  clinical 
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features  do  not  occur  independently  in  NF1  patients,  even  after  adjusting  for  the 
effect  of  age  [Szudek  et  al.,  2000b;  Szudek  et  al.,  unpublished  findings].  Therefore,  we 
also  included  the  binary  variables  representing  the  presence  or  absence  of  other 
associated  features,  as  well  as  gender,  as  covariates  to  minimize  confounding. 
In  addition,  we  considered  stature  and  head  circumference  as  covariates  after 
standardizing  the  measurements  for  each  patient  to  age-  and  gender-specific 
population  norms  [Szudek  et  al.,  2000a]. 

For  the  multivariate  probit  model,  the  binary  response  vector  is  (Yj,. .  Yfc)  for  a 
family  of  size  k,  Yj  is  1  if  a  latent  variable  Zj  <  /?0  +  /Tx,  where  /J0  is  an  intercept  and  /? 
is  a  vector  of  regression  coefficients  for  the  covariate  vector  x.  (Zu. . .,  Zk)  have  a  joint 
multivariate  normal  distribution  with  zero  mean  vector  and  correlation  matrix  R, 
where  the  (latent)  correlation  for  a  given  pair  depends  on  the  relation  type  of  the  pair. 

The  estimates  of  the  regression  coefficients  and  latent  correlation  coefficients, 
together  with  an  estimated  covariance  matrix  and  standard  errors,  were  obtained  by 
numerical  maximum  likelihood,  using  the  quasi-Newton  algorithm  [Nash,  1990]. 
The  multivariate  normal  rectangle  probabilities  for  the  multivariate  probit  model 
were  computed  using  fast  approximation  methods  [Joe,  1995];  the  first-order 
approximation  requires  only  bivariate  normal  rectangle  probabilities,  and  the 
second-order  approximation  requires  multivariate  probabilities  up  to  the  fourth 
dimension.  These  approximations  have  made  feasible  the  computations  for  the 
multivariate  probit  model;  they  are  much  more  accurate  than  older  methods,  such  as 
the  approximation  of  Mendell  and  Elston  [1974], 

Univariate  probit  regressions  were  used  to  obtain  appropriate  functions  for 
age  (e.g.,  e-age*14)  and  initial  estimates  of  regression  coefficients  for  covariates 
representing  related  features,  interactions  between  related  features,  and  gender. 
Familial  aggregation  was  assessed  among  sibs,  parent-child  pairs  (including  mother- 
child  and  father-child  pairs  separately),  and  second-degree  relatives  with  a 
multivariate  probit  model. 

Parameters  and  coefficients  with  95%  confidence  intervals  that  excluded  zero 
were  deemed  statistically  significant.  Standard  errors  and  covariance  matrices  were 
used  to  test  for  differences  between  intrafamilial  correlation  coefficients  for  different 
comparisons.  For  example,  to  test  for  a  difference  between  a  sib-sib  correlation  and 
parent-child  correlation,  we  used  the  following  formula: 

z  =  rJi-_r£S_  ^  =  J(SErJ  +  (SErpf  -  2  cov(rss,  rpc). 

Z-scores  were  converted  into  P-values  according  to  the  standard  normal  distribu¬ 
tion.  We  used  one-tailed  tests  to  compare  correlations  between  first-degree  and 
second-degree  relatives  and  between  sib  pairs  and  parent-child  pairs,  because  we  had 
a  prior  expectation  that  correlations  between  sibs  would  be  at  least  as  strong  as  those 
between  parents  and  children  [Easton  et  al.,  1993;  Szudek  et  al.,  2000b].  We  used 
two-tailed  tests  to  compare  mother-child  correlations  to  father-child  correlations. 


RESULTS 

We  studied  904  individuals  with  NF1  from  373  families  with  two  or  more 
affected  members.  Ninety-one  percent  of  the  individuals  studied  were  white,  2% 
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Asian,  1%  Black,  1%  Latin,  and  the  remainder  either  of  “other”  or  “unknown” 
ethnic  origin.  Table  I  shows  the  prevalences  of  each  of  the  10  NF1  clinical  features  in 
affected  fathers,  mothers,  and  their  affected  children  in  the  NFDB  study  sample,  and 
compares  them  to  the  prevalences  in  the  sample  used  by  Easton  et  al.  [1993]. 

Familial  aggregation  among  various  classes  of  relatives  was  estimated  using 
multivariate  regression  models.  Table  II  shows  the  regression  parameters  and 
standard  errors  for  the  terms  that  were  included  in  each  model.  The  strength  of 
association  between  the  modelled  feature  and  a  covariate  is  measured  by  /?.  A  unit 
increase  in  the  value  of  the  covariate  means  the  modelled  feature  is  exp(2/?)  times 
more  likely  to  be  present.  For  example,  subjects  with  intertriginous  freckling  were 
exp(2  x  0.52)  =  2.8  times  more  likely  also  to  have  cafe-au-lait  spots  than  subjects  of 
the  same  age  and  gender  without  intertriginous  freckling.  Also,  subjects  with 
intertriginous  freckling  and  subcutaneous  neurofibromas  were  exp(2  x  (0.52  — 
0.18 +  0.62))  =  6.8  times  more  likely  also  to  have  cafe-au-lait  spots. 

The  parameter  estimates  for  age  were  highly  significant  (P<  0.001)  for  Lisch 
nodules,  subcutaneous  neurofibromas,  cutaneous  neurofibromas,  and  intertriginous 
freckling;  significant  (P<0.05)  for  cafe-au-lait  spots,  optic  gliomas,  and  plexiform 
neurofibromas;  and  not  significant  (P>0.05)  for  other  neoplasms,  seizures,  or 
scoliosis.  The  parameter  estimate  for  gender  was  not  significant  in  any  of  the  models. 

Table  III  shows  the  number  of  sib,  parent-child  (including  mother-child  and 
father-child),  and  second-degree  relative-pairs  used  in  each  model.  Subjects  were 


TABLE  L  Number  and  Percentage  of  Subjects  From  the  NFDB  and  From  Easton  et  al  [1993]  With  Various 
NF1  Features11 


NFBD  Easton  et  al. 

[1993] 

Affected  Affected  All 

Affected  Affected  children  of  children  of  All  affected  affected 
Feature  fathers  mothers  affected  fathers  affected  mothers  relatives  relatives 


N 

% 

N 

% 

N 

% 

N 

% 

N 

% 

N 

% 

Lisch  nodules 

63 

81 

101 

80 

64 

63 

101 

51 

409 

60 

Cafe-au-lait 

67 

68 

125 

73 

115 

79 

181 

74 

657 

75 

129 

88 

macules 

Cutaneous 

78 

80 

119 

69 

37  . 

25 

67 

27 

355 

40 

121 

76 

neurofibromas 

Optic  glioma 

4 

13 

2 

4 

8 

14 

14 

15 

45 

15 

9 

5 

Subcutaneous 

60 

61 

99 

59 

31 

21 

58 

23 

291 

33 

neurofibromas 

Intertriginous 

79 

83 

147 

88 

112 

78 

191 

78 

699 

80 

freckling 

Seizures 

7 

7 

12 

7 

8 

5 

16 

6 

58 

6 

12 

7 

Plexiform 

24 

24 

35 

20 

25 

18 

44 

18 

176 

20 

37 

21 

neurofibromas 

Scoliosis 

7 

8 

8 

6 

29 

22 

32 

14 

96 

12 

27 

16 

Other  neoplasms 

4 

4 

11 

6 

4 

3 

7 

3 

33 

4 

aFeatures  not  considered  by  Easton  et  al.  [1993]  are  left  blank. 
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TABLE  II.  Continued 


Modelled  feature 

Intercept  and  covariates 

SE 

Scoliosis 

Po 

-1.11 

0.09 

Age 

-0.57 

0.34 

Male  gender 

-0.02 

0.11 

Other  neoplasms 

Po 

-0.95 

0.23 

Age 

-4.07 

2.11 

Male  gender 

-0.06 

0.21 

Lisch  nodules 

-0.55 

0.25 

Optic  glioma 

0.32 

0.31 

aThe  first  column  lists  the  10  modelled  features.  Columns  2-4  show  the  co variates  and  their  regression 
parameter  estimates  (fi )  with  standard  errors  (SE)  used  in  each  model.  p0  is  the  intercept  in  the  model 
equation.  Each  regression  accounts  for  covariates  such  as  related  features,  interactions  between  related 
features,  age,  and  gender.  Interactions  are  depicted  by  features  separated  by  an  asterisk,  and  their  values 
equal  the  product  of  the  two  interacting  features. 


TABLE  m.  Number  of  Relatives  Used  in  Multivariate  Probit  Models  for  10  Clinical  NF1  Features 


Modelled  feature 

Sibs 

Number  of  affected  pairs  included  for  feature 

Mother-child  Father-child  Second-degree  relatives 

Lisch  nodules 

192 

159 

79 

35 

Cafe-au-lait  macules 

248 

210 

129 

69 

Cutaneous  neurofibromas 

264 

224 

131 

69 

Optic  glioma 

55 

37 

26 

4 

Subcutaneous  neurofibromas 

253 

220 

131 

69 

Intertriginous  freckling 

179 

148 

75 

35 

Seizures 

268 

233 

140 

74 

Plexiform  neurofibromas 

264 

224 

131 

69 

Scoliosis 

228 

191 

131 

53 

Other  neoplasms 

47 

33 

20 

3 

included  in  a  model  only  if  the  status  (“presence”  or  “absence”)  of  the  modelled 
feature  and  all  co  variates  was  known. 

Figure  1  shows  the  adjusted  intrafamilial  correlation  coefficients  and  95% 
confidence  intervals  for  six  clinical  features  among  746  affected  first-degree  relatives 
and  among  148  affected  second-degree  relatives.  The  multivariate  probit  regression 
failed  to  converge  on  correlation  coefficients  between  second-degree  relatives 
for  optic  glioma,  other  neoplasms,  seizures,  or  scoliosis  because  of  the  low  frequency 
of  these  features  and  insufficient  sample  size.  We  did  obtain  correlation  coefficients 
between  first-degree  relatives  for  these  features,  but  none  of  these  correlations 
was  significantly  different  from  zero.  Statistically  significant  positive  correlations  be¬ 
tween  first-degree  relatives  were  found  for  5  of  the  6  other  features  listed  in  Figure  1. 
Significant  positive  correlations  between  second-degree  relatives  were  also  found  for 
2  of  these  6  features.  Significant  negative  correlations  were  not  observed  for  any  of 
the  features.  Correlations  were  significantly  greater  among  first-degree  relatives  than 
among  second-degree  relatives  for  Lisch  nodules  (P  =  0.0001)  and  cafe-au-lait  spots 
(P= 0.0004).  Correlations  among  first-degree  relatives  were  not  statistically  different 
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Fig.  1.  Adjusted  intrafamilial  correlation  coefficients  and  95%  confidence  intervals  for  six  clinical 
features  among  746  affected  first-degree  relatives  and  148  affected  second-degree  relatives.  Star  indicates  a 
significant  difference  between  correlation  coefficients  of  the  two  classes  being  compared.  The  multivariate 
probit  regression  failed  to  converge  on  correlation  coefficients  between  second-degree  relatives  for  optic 
glioma,  other  neoplasms,  seizures,  or  scoliosis  because  of  the  low  frequency  of  these  features  and 
insufficient  sample  size.  2°,  second  degree;  1°,  first  degree. 


from  correlations  among  second-degree  relatives  for  subcutaneous  neurofibromas 
(P=0.06),  cutaneous  neurofibromas  (P=0.49),  intertriginous  freckling  (P=0.07),  or 
plexiform  neurofibromas  (P=0.11). 

Figure  2  shows  the  adjusted  intrafamilial  correlation  coefficients  and  95% 
confidence  intervals  for  six  features  among  268  affected  sib  pairs  and  among  373 
affected  parent-child  pairs.  Again,  the  multivariate  probit  regression  failed  to  converge 
on  correlation  coefficients  between  sibs  or  parent-child  pairs  for  optic  glioma,  other 
neoplasms,  seizures,  or  scoliosis.  Statistically  significant  positive  correlations  between 
sibs  were  found  for  all  six  features  in  Figure  2.  Significant  positive  correlations 
between  parents  and  children  were  found  for  4  of  the  6  features.  Significant  negative 
correlations  were  not  observed  for  any  of  the  features.  Correlations  were  significantly 
greater  between  sibs  than  between  parents  and  children  for  subcutaneous 
neurofibromas  (P=0.04),  cafe-au-lait  spots  (P= 0.001),  intertriginous  freckling 
(P=0.03),  and  plexiform  neurofibromas  (P=0.02).  Correlations  between  sibs  were 
not  statistically  different  from  the  correlations  between  parents  and  children  for  Lisch 
nodules  (P=0.40)  or  cutaneous  neurofibromas  (P=0.29). 

Figure  3  shows  the  adjusted  intrafamilial  correlation  coefficients  and  95% 
confidence  intervals  for  six  features  between  233  affected  mother-child  pairs  and 
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Fig.  2.  Adjusted  intrafamilial  correlation  coefficients  and  95%  confidence  intervals  for  six  features 
among  268  affected  sib  pairs  and  373  affected  parent-child  pairs.  Star  indicates  a  significant  difference 
between  correlation  coefficients  of  the  two  classes  being  compared.  The  multivariate  probit  regression 
failed  to  converge  on  correlation  coefficients  between  sibs  or  parent-child  pairs  for  optic  glioma,  other 
neoplasms,  seizures,  or  scoliosis. 


between  140  affected  father-child  pairs.  Statistically  significant  positive  correlations 
between  mothers  and  children  were  found  for  3  of  the  6  features.  Significant  positive 
correlations  between  fathers  and  children  were  found  for  4  of  the  6  features. 
Significant  negative  correlations  were  not  observed  for  any  of  the  features  in  either 
relationship.  Correlations  between  fathers  and  children  were  significantly  greater 
than  correlations  between  mothers  and  children  for  Lisch  nodules  (P= 0.001), 
subcutaneous  neurofibromas  (P  =  0.0001),  and  cutaneous  neurofibromas  (P  =  0.02). 
Correlations  did  not  differ  significantly  between  father-child  pairs  and  mother-child 
pairs  for  cafe-au-lait  spots  (P=0.62),  intertriginous  freckling  (P  =  0.71),  or  plexiform 
neurofibromas  (P  =  0.17). 

DISCUSSION 

Limitations  of  Our  Data  and  Methods 

We  analyzed  familial  latent  correlations  for  10  NF1  clinical  features  while 
adjusting  for  other  related  features,  age,  and  gender  through  statistical  modelling. 
We  were  able  to  test  for  differences  between  correlations  among  various  classes  of 
relatives  for  6  of  the  10  features  studied.  Differences  between  various  classes  of 
relatives  were  found  for  each  of  these  six  features  (Figs.  1-3). 

The  NFDB  draws  its  information  from  specialized  clinics,  so  we  were  concerned 
about  the  representativeness  of  our  sample.  However,  frequencies  of  features  found 
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Fig.  3.  Adjusted  intrafamilial  correlation  coefficients  and  95%  confidence  intervals  for  six  features 
between  233  affected  mother-child  pairs  and  140  affected  father-child  pairs.  Star  indicates  significant 
difference  between  correlation  coefficients  of  the  two  classes  being  compared.  2°,  second  degree;  1°,  first 
degree. 


among  the  familial  cases  used  in  this  study  (Table  I)  are  comparable  to  those  seen  in 
another  family  study  of  variable  NF1  expressivity  [Easton  et  al.,  1993]  and  in  two 
available  population-based  studies  of  NF1  patients  [Huson  et  ah,  1989;  Samuelsson 
and  Axelsson,  1981]. 

Easton  et  al.  [1993]  studied  175  individuals  with  NF1  from  48  families,  including 
6  pairs  of  monozygotic  twins,  76  pairs  of  sibs,  60  parent-offspring  pairs,  54  second- 
degree  relative  pairs,  and  43  third-degree  relative  pairs.  These  investigators  examined 
eight  NF1  clinical  features  and  found  significant  intrafamilial  correlations  for 
three  quantitative  variables:  number  of  cafe-au-lait  spots,  number  of  cutaneous 
neurofibromas,  and  head  circumference.  Easton  et  al.  [1993]  also  analyzed  five  traits 
as  binary  variables,  but  these  comparisons  did  not  include  adjustment  for  age. 
Furthermore,  none  of  their  analyses  adjusted  for  the  nonindependence  of  multiple 
relative-pairs  from  the  same  family  or  of  various  clinical  features.  Our  sample  size  is 
five  times  larger,  and  we  examined  10  clinical  features,  6  of  which  are  the  same  as 
those  of  Easton  et  al.  [1993].  Also,  we  included  associations  between  features  as 
covariates  in  the  familial  analyses.  Unlike  Easton  et  al.  [1993],  we  did  not  have 
counts  of  cafe-au-lait  spots  and  dermal  discrete  neurofibromas,  but  the  quantitative 
investigations  of  these  features  by  Easton  et  al.  [1993]  complement  our  binary 
analyses  nicely.  Both  studies  found  evidence  of  modifying  genes  on  cafe-au-lait  spots 
but  not  on  dermal  discrete  neurofibromas. 

All  of  the  features  we  studied  were  treated  as  binary  variables.  Many  of  the 
clinical  features  of  NF1  (and  other  diseases)  are  by  nature  binary,  and  ours  is  the  first 
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study  to  examine  correlations  for  binary  traits  among  different  familial  relationships 
while  accounting  for  continuous  covariates  such  as  age.  Similar  methods  were  used 
to  study  lens  opacities  [Anonymous,  1994]  and  liver  cancer  [Liang  and  Beaty,  1991] 
in  individuals  who  did  not  have  NF1,  but  we  may  be  the  first  to  study  an  autosomal- 
dominant  disease  in  this  manner. 

Although  this  is  by  far  the  largest  group  of  NF1  families  ever  studied,  we  only 
had  74  pairs  of  second-degree  relatives.  Models  for  most  features  used  even  fewer 
second-degree  relatives  because  the  data  were  incomplete.  Subjects  were  included  in 
a  model  only  if  the  status  of  the  modelled  feature  and  of  all  covariates  was  known 
(Table  III).  These  relatively  small  sample  sizes  are  reflected  in  the  wide  95% 
confidence  intervals  for  the  correlation  coefficients  among  second-degree  relatives 
(Fig.  1).  Statistical  techniques  are  less  reliable  for  smaller  sample  sizes,  so  we  must 
attach  an  additional  note  of  caution  to  the  point  estimates  for  the  correlation 
coefficients  between  second-degree  relatives,  particularly  for  Lisch  nodules  and 
intertriginous  freckling,  in  which  the  analysis  included  only  35  pairs  of  second-degree 
relatives  (Table  III). 

Several  features  had  significantly  positive  correlations  among  second-degree 
relatives,  but  none  of  these  correlations  was  significantly  greater  than  that  for  the 
same  feature  among  first-degree  relatives  (Fig.  1).  Similarly,  several  features  had 
significantly  positive  correlations  between  parents  and  children,  but  none  of  the 
correlations  was  greater  than  that  for  the  same  feature  between  sibs  (Fig.  2).  The 
absence  of  significant  negative  correlations  supports  the  statistical  validity  of  our 
approach.  One  would  expect  to  observe  negative,  as  well  as  positive,  correlations  by 
chance  when  making  multiple  comparisons. 

The  most  important  confounding  factor  in  familial  analyses  of  NF1  is  age. 
Many  disease  features  are  more  prevalent  in  older  NF1  patients  [Cnossen  et  al., 
1998],  and  if  not  appropriately  controlled,  age  might  produce  a  correlation  between 
affected  relatives  of  similar  age  (e.g.,  sibs)  or  obscure  a  correlation  between  relatives 
of  very  different  ages  (e.g.,  parents  and  children).  Our  multivariate  models  minimize 
the  confounding  effect  of  age,  but  they  may  not  eliminate  it  completely.  The 
covariate  representing  age  was  significant  in  models  for  most  features,  but  it  is 
possible  that  a  residual  age  effect  is  contributing  to  the  observed  differences  between 
sib-sib  and  parent-child  pairs  for  features  such  as  subcutaneous  neurofibromas  and 
intertriginous  freckling  that  become  more  prevalent  with  age  (Fig.  2).  Age  is  less 
likely  to  influence  the  intrafamilial  correlations  for  cafe-au-lait  spots  or  plexiform 
neurofibromas  which,  when  considered  as  discrete  variables,  occur  with  a  relatively 
stable  frequency  with  age  [DeBella  et  al.,  2000;  Friedman  et  al.,  1999]. 

Patterns  of  Associations  of  Clinical  Features  Among 
Relatives 

Lisch  nodules  and  cafe-au-lait  spots  had  significantly  higher  correlations  among 
first-degree  relatives  than  among  second-degree  relatives.  Higher  correlations  for 
first-  than  second-degree  relatives  would  be  expected  for  effects  produced  by 
modifying  genes  at  unlinked  loci,  but  might  also  result  from  environmental  factors 
that  are  more  likely  to  be  shared  among  closer  relatives.  Our  observations  are 
consistent  with  the  effect  of  a  modifying  gene  on  the  pathogenesis  of  Lisch  nodules. 
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Easton  et  al.  [1993]  found  a  higher  correlation  for  cafe-au-lait  spots  between 
monozygotic  twins  than  between  sibs,  suggesting  the  effect  of  a  genetic  locus  or  loci 
in  addition  to  NF1.  Our  findings  of  a  strong  correlation  for  cafe-au-lait  spots  in  first- 
degree  relatives  but  no  correlation  among  second-degree  relatives  are  consistent  with 
this  interpretation. 

Lisch  nodules  and  cafe-au-lait  spots  share  an  origin  from  neural  crest-derived 
tissue,  but  this  is  also  true  of  some  other  lesions  characteristic  of  NF1,  including 
neurofibromas  of  all  types  and  intertriginous  freckling  [Bolande,  1981],  We 
previously  reported  an  association  between  the  occurrence  of  Lisch  nodules  and 
cafe-au-lait  spots  in  individual  NF1  patients  [Szudek  et  al.,  2000b],  but  intertriginous 
freckling  was  also  associated,  i.e.,  a  feature  that  shows  no  indication  of  a  stronger 
familial  correlation  among  first-degree  than  second-degree  relatives  (Fig.  1). 

Intertriginous  freckling,  subcutaneous  neurofibromas,  plexiform  neurofibro¬ 
mas,  and  cafe-au-lait  spots  had  higher  correlations  between  sibs  than  between 
parents  and  children.  Easton  et  al.  [1993]  found  that  concordance  for  dermal  discrete 
neurofibromas  (which  include  subcutaneous  neurofibromas)  between  monozygotic 
twins  was  much  higher  than  between  sibs,  an  observation  that  suggests  the 
involvement  of  a  genetic  factor.  Affected  sibs  would  be  expected  to  share  the  same 
normal  NF1  allele  by  descent  half  of  the  time,  but  parent-child  pairs  rarely  would. 
Effects  of  functional  polymorphisms  of  the  normal  NF1  allele  might  explain  a  higher 
correlation  of  these  features  among  sib  pairs  than  among  parent-child  pairs.  Another 
possible  explanation  is  that  differences  in  environmental  factors  are  more  likely  to  be 
shared  among  sibs  than  between  a  parent  and  child. 

Intertriginous  freckling,  subcutaneous  neurofibromas,  plexiform  neurofibro¬ 
mas,  and  cafe-au-lait  spots  all  share  an  origin  from  neural  crest-derived  cells.  We 
found  that  cafe-au-lait  spots  and  intertriginous  freckling  tended  to  occur  together  in 
individual  NF1  patients,  and  so  did  cutaneous,  subcutaneous,  and  plexiform 
neurofibromas,  but  associations  were  not  seen  between  the  features  in  these  two 
groups  [Szudek  et  al.,  unpublished  findings].  In  the  present  study,  we  did  not  find  a 
stronger  correlation  for  cutaneous  neurofibromas  in  sibs  than  in  parent-child  pairs, 
as  we  did  for  subcutaneous  and  plexiform  neurofibromas  (Fig.  2). 

Lisch  nodules,  subcutaneous  neurofibromas,  and  cutaneous  neurofibromas  had 
higher  correlations  between  affected  fathers  and  children  than  between  affected 
mothers  and  children  (Fig.  3).  Our  sample  included  twice  as  many  mother-child  pairs 
as  father-child  pairs,  so  we  were  concerned  about  ascertainment  bias,  i.e.,  the 
possibility  that  only  severely  affected  father-child  pairs  tended  to  be  seen  in  the  NF 
clinics  that  contributed  data  to  the  NNFF  International  Database.  However,  the 
frequencies  of  all  features  studied  were  similar  in  affected  fathers  and  affected 
mothers  (Table  I). 

Shared  environment  is  unlikely  to  be  the  sole  cause  of  associations  between 
parents  and  children,  due  to  large  differences  in  age.  It  is  also  unlikely  that  shared 
environment  is  responsible  for  the  difference  in  correlations  between  mother-child 
and  father-child  pairs.  Likewise,  a  multifactorial  influence  with  a  more  extreme 
threshold  for  males  than  for  females  cannot  explain  the  observations  for  these 
features.  Gender  is  not  a  significant  predictive  factor  in  any  of  our  models  (Table  II), 
and  feature  frequencies  among  affected  children  of  affected  fathers  are  similar  to 
those  among  affected  children  of  affected  mothers  (Table  I).  Parent-of-origin  effects 
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on  severity  of  NF1  have  been  suggested  [Hall,  1981;  Miller  and  Hall,  1978],  but  most 
studies  do  not  support  this  possibility  [Huson  et  al.,  1989;  Riccardi  and  Wald,  1987]. 
Our  findings  are  consistent  with  a  parent-of-origin  effect  on  the  strength  of  the 
parent-child  correlation  rather  than  with  a  more  severe  phenotype  in  affected 
offspring  of  parents  of  one  gender  when  compared  to  affected  offspring  of  parents  of 
the  other  gender.  Similar  parent-child  aggregation  patterns  were  reported  for  body 
mass  index  [Friedlander  et  al.,  1988]  and  blood  pressure  [Hurwich  et  al.,  1982],  but 
they  are  unprecedented  in  NF1.  We  do  not  know  of  a  genetic  mechanism  that  can 
explain  this  phenomenon. 

CONCLUSIONS 

The  patterns  of  familial  correlations  shown  here  suggest  that  genetic  factors 
involved  in  determining  the  occurrence  of  various  clinical  features  of  NF1  vary, 
depending  on  the  feature.  In  some  instances,  the  effects  of  unlinked  modifying  genes 
may  be  most  important.  In  other  instances,  the  effects  of  the  normal  NF1  allele  may 
predominate.  More  than  one  genetic  factor  may  be  involved,  and  the  relative 
importance  of  various  genetic  and  nongenetic  effects  may  vary  for  different  features. 

Some  of  the  clinical  variability  that  characterizes  NF1  may  result  from  allelic 
heterogeneity  of  the  constitutional  NF1  mutation.  Many  NF1  patients  have  been 
genotyped,  but  little  evidence  of  allele-phenotype  correlation  has  been  observed 
[Rasmussen  and  Friedman,  2000].  This  may  be  because  phenotypic  differences 
resulting  from  NF1  allelic  heterogeneity  are  generally  small  in  comparison  to  other 
sources  of  variability.  It  is  also  possible,  however,  that  important  NF1  genotype- 
phenotype  correlations  exist  but  have  not  been  recognized  because  of  the  complexity 
of  the  NF1  phenotype  [Riccardi,  1999],  its  strong  dependence  on  age  [DeBella  et  al., 
2000],  the  nonindependence  of  many  clinical  features  [Szudek  et  al.,  2000b;  Szudek 
et  al.,  unpublished  findings],  and  the  heterogeneity  of  pathogenic  NF1  mutations 
[Fahsold  et  al.,  2000;  Korf,  1999;  Messiaen  et  al.,  2000], 

Our  findings  suggest  that  most  NF1  clinical  features  have  important  genetic 
components.  The  patterns  of  variable  expressivity  are  subtle,  so  data  will  be  required 
on  a  very  large  number  of  patients  and/or  on  very  large  families  to  identify 
modifying  genes  that  affect  the  NF1  phenotype.  Objective  quantitative  variables 
such  as  lesion  counts  would  enable  a  more  detailed  analysis  of  familial  segregation 
patterns  and  would  require  fewer  patients  than  binary  variables  of  the  type  used  in 
the  analyses  reported  here.  Given  the  progressive  nature  of  many  NF1  disease 
features  and  the  potentially  confounding  effects  of  age  on  analysis,  it  is  essential  that 
the  data  be  representative  of  all  age  groups.  A  very  dense  map  of  single-nucleotide 
polymorphisms  (SNPs)  is  now  available  in  humans  [Sachidanandam  et  al.,  2001],  so 
a  random  genome  scan  for  NF1  modifying  loci  is  theoretically  possible.  Improved 
understanding  of  neurofibromin’s  biochemical  functions  may  permit  the  discovery  of 
interacting  proteins  and  of  upstream  and  downstream  effectors  that  are  critical  to  the 
development  of  particular  phenotypic  features.  This  would  greatly  facilitate  the 
identification  of  modifying  loci. 

The  multivariate  probit  regression  methods  used  in  this  study  to  estimate 
familial  aggregation  of  NF1  clinical  features,  while  adjusting  for  age,  gender,  and  the 
presence  of  other  clinical  features,  are  likely  to  be  useful  for  analysis  of  other  genetic 
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diseases.  Application  of  these  methods  to  Mendelian  conditions  that  have  highly 
variable  and  age-dependent  phenotypes,  such  as  tuberous  sclerosis  complex  [Cheadle 
et  ah,  2000],  Gorlin  syndrome  [Wicking  and  Bale,  1997],  and  Stickler  syndrome 
[Snead  and  Yates,  1999],  seems  especially  promising. 
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Neurofibromatosis  2  (NF2)  is  a  genetic  disease  that  occurs  in  approximately  1  in  40,000  live  births.  Almost  all  affected 
individuals  develop  bilateral  tumors  of  Schwann  cells  that  surround  the  vestibular  nerves;  these  tumors  are  known  as 
vestibular  schwannomas  (VS).  Evidence  from  molecular  genetic  studies  suggests  that  at  least  two  mutations  are  involved  in 
formation  of  VS  in  patients  with  NF2.  Several  authors  proposed  probabilistic  models  for  this  process  in  other  tumors,  and 
showed  that  such  models  are  consistent  with  incidence  data.  We  evaluated  two  different  probabilistic  models  for  a  "2-hit" 
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INTRODUCTION 

Probabilistic  models  for  tumorigenesis  have 
been  used  extensively  in  genetic  epidemiology  to 
generate  and  test  hypotheses  about  the  genetic 
mechanisms  that  are  responsible  for  tumor  devel¬ 
opment  [Armitage  and  Doll,  1954;  Hethcote  and 
Knudson,  1978;  Knudson,  1971;  Moolgavkar  and 
Knudson,  1981;  Moolgavkar  and  Luebeck,  1992]. 
The  common  theme  incorporated  into  most  of 
these  models  is  that  a  tumor  cell  is  assumed  to  be 
the  outcome  of  a  sequence  of  irreversible  events 
(mutations).  These  events  progressively  transform 
normal  tissue  cells  into  tumor  cells.  Chu  [1987] 
provided  a  clear,  nonmathematical  introduction  to 
these  models. 

Knudson  [1971]  and  Knudson  et  al.  [1975] 
proposed  a  two-stage  model  for  cancer  initiation 
to  describe  the  incidence  of  both  sporadic  and 
hereditary  retinoblastomas.  According  to  this 


model,  a  tissue  cell  is  transformed  into  a  tumor 
cell  after  sustaining  two  irreversible  mutations. 
This  is  often  referred  to  as  a  "2-hit”  model,  where 
the  term  "hit"  refers  to  mutations  in  a  cell.  The 
first  of  these  mutations  is  assumed  to  occur  in  one 
of  two  ways:  in  hereditary  cases,  individuals 
inherit  the  first  mutation;  in  sporadic  cases,  the 
first  mutation  occurs  by  chance  in  a  somatic-cell 
progenitor  of  the  tumor.  The  second  mutation  is 
assumed  to  occur  by  chance  in  a  tumor-cell 
progenitor  in  both  hereditary  and  sporadic 
cases. 

Subsequent  molecular  genetic  studies  demon¬ 
strated  that  this  model  was  correct  in  that  both 
alleles  of  the  RBI  locus  are  almost  always  mutated 
in  retinoblastoma  [Knudson,  1996].  Moreover,  one 
mutant  RBI  allele  is  inherited  and  the  second 
allele  is  lost  somatically  in  tumors  in  patients  with 
hereditary  retinoblastoma,  whereas  both  alleles 
are  lost  somatically  in  sporadic  retinoblastomas. 
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as  predicted  [Knudson,  1996].  Many  other  heredi¬ 
tary  tumors,  including  those  associated  with  Li- 
Fraumeni  syndrome  [Evans  and  Lozano,  1997], 
hereditary  breast  and  ovarian  cancer  syndrome 
[Hofmann  and  Schlag,  2000],  hereditary  adeno¬ 
matous  polyposis  [Aaltonen,  2000],  and  neurofi¬ 
bromatosis  1  [Parada,  2000],  have  been  shown  to 
involve  a  similar  "2-hit"  mechanism.  Functional  or 
actual  loss  of  both  alleles  of  these  and  other  tumor 
supressor  genes  is  now  known  to  be  an  important 
pathogenic  mechanism  in  most  neoplasms 
[Fearon,  2001]. 

However,  it  is  clear  that  the  pathogenesis  of 
many  tumors  involves  more  than  just  two  "hits." 
Tumors  often  exhibit  mutations  or  functional 
alteration  of  several  different  genetic  loci,  and 
none  of  these  changes  may  be  sufficient  by  itself  to 
produce  a  malignant  phenotype  [Strachan  and 
Read,  1999].  It  is,  therefore,  of  some  interest  to 
consider  more  complex  epidemiological  models 
than  that  originally  formulated  by  Knudson  [1996]. 
Moolgavkar  and  Venzon  [1979]  introduced  an 
alternate  mathematical  formulation  of  the  2-hit 
model  that  incorporated  both  the  growth  and  death 
of  cells  in  the  tissue  at  risk;  this  model  is  depicted 
in  Fig.  1.  This  formulation  is  also  convenient  to 
extend  to  a  3-hit  model;  such  a  model  (Fig.  2) 
would  require  that  a  tissue  cell  sustain  three 
mutations  to  develop  into  a  tumor  cell. 

Neurofibromatosis  2  (NF2)  is  a  dominantly 
inherited  tumor  predisposition  syndrome  [Mac- 
Collin,  1999;  MacCollin  and  Stemmer-Rachami- 
nov,  1999].  Most  affected  patients  develop 
bilateral  vestibular  schwannomas  (VS),  and  many 


develop  schwannomas  of  other  nerves  and  me¬ 
ningiomas.  Mutations  of  both  alleles  of  the  NF2 
locus  have  been  demonstrated  in  tumors  from 
patients  with  NF2  as  well  as  in  sporadic  VS  in 
patients  who  do  not  have  NF2  [Evans  et  al.,  2000]. 
In  accordance  with  the  hypothesis  of  Knudson 
[1996],  a  germline  NF2  mutation  can  be  demon¬ 
strated  in  most,  and  is  presumed  to  exist  in  all, 
patients  with  NF2  [Evans  et  al.,  2000]. 

The  purpose  of  this  study  is  to  fit  a  selection  of 
multistage  models  for  tumor-cell  development  to 
VS  data  from  patients  with  NF2.  VS  are  tumors  of 
Schwann  cells  that  surround  the  vestibular  nerves; 


Fig.  1.  Two-hit  model  for  development  of  vestibular  schwanno¬ 
mas  in  NF2  patients.  All  cells  in  the  body  have  at  least  one 
mutation,  inherited  through  the  germline.  Circles  denote  cells  in 
a  given  stage  of  the  model,  and  arrows  denote  possible 
transitions  between  stages  of  the  model,  a,  p,  and  ft  represent 
cell  growth,  death,  and  mutation  rates,  respectively. 


Fig.  2.  Three-hit  model  for  development  of  vestibular  schwannomas  in  NF2  patients.  All  cells  in  the  body  have  at  least  one  mutation, 
inherited  through  the  germline.  Circles  denote  cells  in  a  given  stage  of  the  model,  and  arrows  denote  possible  transitions  between 
stages  of  the  model  a,  p,  and  ft  represent  cell  growth,  death,  and  mutation  rates,  respectively. 
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hence  the  tissue  at  risk  for  our  study  is  the  pool  of 
Schwann  cells  that  surround  both  vestibular 
nerves.  We  will  examine  the  fit  of  two  different 
2-hit  models  and  a  3-hit  model  to  our  patient 
data. 


MODELS  AND  METHODS 

DESCRIPTION  OF  DATA 

The  patient  data  used  to  fit  the  models  were 
obtained  from  the  United  Kingdom  NF2  Registry, 
which  contains  clinical  and  genetic  information  on 
a  large  number  of  NF2  patients  ascertained 
through  medical  specialists  throughout  the  United 
Kingdom.  Information  available  for  each  patient 
included  age  at  onset  of  first  VS,  whether  the  VS 
was  unilateral  or  bilateral,  and  family  history.  The 
database  contains  163  NF2  probands  who  had 
sufficient  information  for  inclusion  in  this  study. 
For  the  analyses  that  follow,  we  assume  that  this 
sample  is  representative  of  all  NF2  patients.  The 
tumor  of  interest  is  the  VS,  and  the  dependent 
variable  that  we  model  is  the  age  at  occurrence  of 
the  first  tumor  cell.  We  assume  that  the  time 
between  development  of  the  first  tumor  cell  and 
the  age  at  which  the  first  VS  is  detected  is  roughly 
constant  and  therefore  does  not  affect  model 
comparisons.  Although  all  patients  included  in 
our  analysis  have  bilateral  VS,  we  model  only  the 
time  to  diagnosis  of  the  first  VS  in  these  patients. 

TWO-HIT  MODEL  OF  MOOLGAVKAR  AND 
VENZON  [1979] 

The  two-mutation  model  for  hereditary  tumors 
presented  by  Moolgavkar  and  Venzon  [1979]  is 
appropriate  for  NF2  patients  who  have  one 
mutant  copy  of  the  NF2  gene  present  in  all  cells 
at  birth.  According  to  this  model,  a  Schwann  cell 
in  an  NF2  patient  becomes  a  tumor  cell  when  a 
second  mutation  occurs,  inactivating  the  normal 
NF2  gene.  This  model  assumes  that  in  a  small 
interval  of  time.  At,  a  Schwann  cell  divides  into 
two  Schwann  cells  with  probability  aAt+o(At);  dies 
with  probability  pAt+o(At );  or  mutates  to  form  a 
tumor  cell  with  probability  pAt+o(At).  The  prob¬ 
ability  that  more  than  one  event  occurs  in  this 
interval  of  time  is  o(Af)-  We  will  use  the  notation  0 
to  denote  the  vector  of  model  parameters  (a,  P,  p). 
Additionally,  it  is  assumed  that  cells  behave 
independently  of  one  another  and  that  mutations 
occur  during  cell  division.  We  also  assume  that 
Schwann  cells  with  one  mutation  are  phenotypi¬ 


cally  identical  and  have  an  identical  rate  of 
cellular  proliferation  as  normal  Schwann  cells. 

We  will  refer  to  two  fundamental  statistical 
quantities  throughout  this  paper,  namely  the 
hazard  function  and  the  probability  distribution 
function.  Let  h(t\0)  denote  the  hazard  function  for 
the  random  variable  T;  here  6  is  used  to  denote  the 
vector  of  parameters  from  a  parametric  model  for 
T.  This  function  represents  the  instantaneous  risk 
of  tumor  at  time  t  in  a  previously  tumor-free  tissue 
and  is  defined  in  the  following  fashion: 

hm  (1) 
v  1  '  At->0  At  v  ' 

The  probability  distribution  function  for  the 
random  variable  T,  denoted  by  F(t\0),  represents 
the  probability  that  an  individual  will  develop  a 
tumor  prior  to  or  at  time  t.  We  can  express  F(t\0)  in 
terms  of  the  hazard  function  given  above: 

F(t\0)  =  1  —  exp|  —  J  h(u\0)du^.  (2) 

It  is  worth  noting  that  the  probability  density 
function  for  T  is  given  by  f(t\0)  =  F'(t\0)  = 

mmi-Fm}. 

For  this  model,  Moolgavkar  and  Venzon  [1979] 
showed  that  the  hazard  function  for  the  random 
variable  T,  representing  the  age  at  occurrence  of 
the  somatic  mutation  (i.e.,  of  the  first  tumor  cell), 
is  given  by: 

h(t\0)  =  ~a<f>{l,  0,  t)  +  (a  +  P  +  p) 

-pm,  o,  or1, 


where 


^  n  j.\  Cl  “  Cz  NteXPWCl  -  m] 

<p(l,  0,  0  = - r4 - t-T 

1  -  y^expWCi  -  C2)f} 


(3) 


and 


c,  =2 -(z+p+f) 


-  Va2  -  ^aP  +  +  /?2  +  2/fyi  +  r, 


2a 

C2  =^(a  +  P  +  F) 


(4) 


+  ^  \/a2  -  lap  +  lap  +  p2  +  2Pp  +  p2, 


Note  that  h(t\6)  is  the  hazard  function  for  T 
assuming  a  single  initial  tissue  cell.  If  we  assume 
that  the  tissue  contains  N  cells  initially,  then  the 
hazard  function  for  the  entire  tissue,  h^(t\0),  is 
simply  Nh(t\0);  this  follows  directly  from  the 
assumption  that  the  cells  behave  independently 
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of  one  another.  A  likelihood  for  the  data  can  be 
constructed  in  the  customary  fashion  [Lawless, 
1982],  with  patient  i,  having  tumor  onset  time  tu 
contributing  hN(h\0){{l  -  Fn(U\6)}}  to  the  like¬ 
lihood.  The  likelihood,  as  a  function  of  6,  can  be 
maximized  by  means  of  a  quasi-Newton  algo¬ 
rithm  [Nash,  1990]  to  obtain  maximum  likelihood 
estimates  (MLE)  of  the  model  parameters. 

NONHOMOGENEOUS  POISSON  PROCESS 

Another  approach  to  modeling  this  two-muta- 
tion  process  is  to  make  some  additional  assump¬ 
tions  regarding  the  division  and  death  of  tissue 
cells.  If  the  tissue  were  assumed  to  grow  accord¬ 
ing  to  a  deterministic  process,  then  the  number  of 
cells  present  in  the  tissue  at  any  time  t  can  be 
given  by  a  function  X(f).  If  a  reasonable  functional 
form  for  X(f)  could  be  chosen  and  it  is  assumed 
that  there  is  a  small  chance  that  any  of  the  tissue 
cells  mutate,  then  the  generation  of  tumor  cells 
can  be  modeled  according  to  a  nonhomogeneous 
Poisson  process.  If  we  assume  that  the  rate  at 
which  tissue  cells  mutate  is  a  constant  fi,  then  the 
intensity  of  the  process  is  simply  h(t\n)  =  nX(t), 
which  is  of  course  the  hazard  function  for  the  age 
at  occurrence  of  the  somatic  mutation.  We  will 
refer  to  this  model  as  the  "Poisson"  model. 

A  likelihood  for  the  data  is  easily  constructed 
for  this  model,  and  the  MLE  for  the  mutation  rate 
has  a  simple,  closed  form  solution.  Again,  the 
contribution  of  patient  i  to  the  likelihood  function 
is  =  h(ti\fi){l  —  F(ti\fi)},  and  the  MLE  for  the 
mutation  rate,  estimated  from  a  sample  of  n 
patients,  is  given  by: 


Eli  (/o  X(s)ds) 

It  is  important  to  note  that  the  choice  of  the 
function  X(t)  clearly  influences  the  estimate  of  the 
communication  rate.  The  integral  in  the  expres¬ 
sion  for  the  estimate  of  the  mutation  rate  may 
require  numerical  integration  if  the  function 
chosen  for  X(t)  is  not  conveniently  integrable. 

THREE-HIT  MODEL 

Thus  far  we  have  assumed  that  tumorigenesis  is 
a  two-mutation  process,  and  that  a  tumor  cell  is 
generated  when  a  normal  Schwann  cell  sustains 
two  irreversible  mutations.  Here  we  explore  a 
model  that  assumes  that  a  tumor  cell  is  the  end 
result  of  a  normal  tissue  cell  sustaining  three 
irreversible  mutations;  this  model  will  be  referred 
to  as  the  "3-hit"  model.  In  NF2  patients,  the  first 


of  these  mutations  has  already  been  sustained 
prior  to  birth,  and  the  two  subsequent  mutations 
are  assumed  to  occur  by  chance  in  the  somatic 
tissue.  We  assume  for  simplicity  that  these  two 
mutations  occur  at  a  constant,  common  mutation 
rate  /x.  We  also  assume  that  Schwann  cells  with 
one  mutation  or  with  two  mutations  have  iden¬ 
tical  rates  of  cellular  proliferation  as  normal 
Schwann  cells.  Moolgavkar  and  Luebeck  [1990] 
presented  a  model  that  is  appropriate  for  this 
process,  and  Heidenreich  et  al.  [1997]  discussed 
parameter  identifiability  issues  with  the  model.  It 
is  possible  to  estimate  a  reduced  number  of  model 
parameters  without  assuming  equal  mutation 
rates  in  the  two  stages  of  the  model,  but  a 
different  parameterization  would  require  addi¬ 
tional  assumptions  and  may  yield  estimates  that 
are  difficult  to  interpret. 

Again  we  denote  the  number  of  Schwann  cells 
in  our  tissue  of  interest  by  X(f);  recall  that  these 
cells  already  carry  a  single  mutation.  As  in  the 
case  of  the  Poisson  model,  we  must  specify  a 
functional  form  for  X(f)  to  fit  our  model  to  patient 
data.  We  will  refer  to  cells  that  have  sustained  a 
second  but  not  a  third  mutation  as  intermediate 
cells,  and  cells  that  have  sustained  three  mutations 
as  tumor  cells.  In  a  small  interval  of  time  At,  the 
probability  that  an  intermediate  cell  is  generated 
by  the  mutation  of  a  tissue  cell  is  /xX(f)Af+o(Af). 
The  probability  that  more  than  one  intermediate 
cell  is  generated  in  this  fashion  is  o{At).  The 
growth,  death,  and  mutation  of  the  intermediate 
cells  are  assumed  to  follow  a  birth-death  process. 
Again,  in  a  small  interval  of  time  At,  an 
intermediate  cell  divides  into  two  intermediate 
cells  with  probability  aAt  +  o(At),  dies  with  prob¬ 
ability  0At  +  o(At),  and  divides  into  an  intermedi¬ 
ate  cell  and  a  tumor  cell  with  probability 
fiAt  +  o(At).  The  probability  of  more  than  one 
such  event  occurring  in  this  time  interval  is  o(At). 
Moolgavkar  and  Luebeck  [1990]  showed  that  for 
such  a  model,  the  hazard  function  is  given  by: 

h(t\0)  =  i?  f  X(s)  exp {g(0;  t  -  s)}ds,  (6) 
J  0 

where 

g(6;  t  —  s)  — 

2<xC,(f-s)+21ogi - - 1— E5a - 1  (7) 

g\l-l^exp{a(C1-C2)(t-s)}J  U 

-  (a  +  0  + /u)(t  -  s), 
and  Ci  and  C2  are  defined  as  above. 
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TABLE  I.  Summary  of  assumptions  used  to  estimate  number  of  vestibular  nerve  Schwann  cells  for  Poisson  and  3-hit 
models,,  and  references  that  support  assumptions  chosen 


Factor 

Range 

Midvalue 

References 

Number  of  vestibular  nerves 

2 

2 

Number  of  axons  in  each  vestibular  nerve 

14,000-22,000 

18,500 

Rasmussen,  1940;  Natural  and  Schuknecht,  1972; 
Bergstrom,  1973;  Richter,  1980 

Length 

5-7  mm 

6  mm 

F.H.  Linthicum,  Jr.,  personal  communication 

Intemode  distance  (distance  covered  by 
one  Schwann  cell) 

0.5-1 .0  mm 

0.75  mm 

Natout  et  al.,  1987;  Schalow,  1989;  Vizoso,  1950 

A  likelihood  for  the  data  as  a  function  of  6  can 
be  constructed  in  a  similar  fashion  to  the  previous 
models,  and  estimates  for  the  model  parameters 
can  be  found  by  numerically  maximizing  the 
likelihood.  The  integral  in  Equation  (6)  can  be 
calculated  numerically 

RESULTS 

To  enable  the  fitting  of  both  the  Poisson  and  3-hit 
models  to  patient  data,  we  must  include  an  estimate 
of  the  number  of  Schwann  cells  in  the  tissue  as  a 
function  of  age.  We  assume  that  this  function  is 
constant  in  the  fitting  of  both  of  these  models;  thus 
for  any  time  t,  we  assume  X(f)=300,000.  This 
number  is  calculated  from  the  estimates  summar¬ 
ized  in  Table  I,  as  twice  the  product  of  the  number 
of  axons  found  in  each  vestibular  nerve  and  the 
length  of  a  nerve  axon,  divided  by  the  intemode 
distance.  Using  the  midvalues  in  Table  I,  we  get 
2  x  18,500  x  6.0/0.75—300,000  as  the  total  number 
of  Schwann  cells  in  the  pool  that  gives  rise  to  a 
vestibular  schwannoma. 

We  assume  that  this  number  of  cells  remains 
roughly  constant  [Vizoso,  1950]  from  about  24 
weeks  postconception,  when  it  is  first  achieved, 
through  adulthood.  The  mutations  that  give  rise  to 
a  tumor  must  therefore  occur  during  tissue 
maintenance,  as  rare  cell  divisions  are  triggered 
to  replace  tissue  cells  that  die  from  time  to  time. 

These  assumptions  mean  that  the  value  N  for 
the  2-hit  model  will  be  300,000,  and  that  the 
maximum  value  the  function  X(f)  can  take  for  the 
Poisson  and  3-hit  models  is  also  300,000.  We  will 
comment  further  on  these  assumptions  in  the 
Discussion. 

Any  integration  required  in  the  likelihood  of  the 
3-hit  model  has  been  performed  numerically  with 
Romberg  integration  [Mathews,  1992].  Note  that 
the  integral  in  the  expression  for  the  Poisson 
likelihood  is  simplified  in  the  case  where  X(t)  is 
constant  and  can  be  done  analytically,  and  results 
in  an  exponential  distribution  for  T. 


TABLE  II.  Summary  of  parameter  estimates  from  model 
fitting 


Model:  2-hit  model 

Value  of  log-likelihood:  -669.0 
Akaike  information  criterion:  -672.0 
Parameter  Estimate 

SE 

a 

3.73  x  lO"1 

1.39  x  lfT1 

p 

3.27  x  10"1 

1.39  xlO'1 

3.76  x  1CT8 

1.00  xl(T8 

Model:  Poisson  model 

Value  of  log-likelihood:  — 717.0 

Akaike  information  criterion:  -718.0 

Parameter 

Estimate 

SE 

M- 

l.ii  x  icr7 

0.09  x  10"7 

Model;  3-hit  model 

Value  of  log-likelihood:  -651.3 

Akaike  information  criterion:  -654.3 

Parameter 

Estimate 

SE 

a 

8.26  x  10"’ 

0.71  x  10"1 

p 

8.21  x  lO-1 

0.71  x  10"1 

7.15  x  10"5 

0.59  x  10"5 

The  MLEs  for  the  2-hit  model  parameters  are 
presented  with  their  estimated  standard  errors  in 
Table  II;  the  value  of  the  log-likelihood  evaluated 
at  the  MLE  is  also  provided.  The  estimated  model 
parameters  can  be  input  into  the  distribution 
function  F^(t\0)  and  plotted  by  age  to  provide  a 
comparison  between  the  distribution  produced  by 
the  model  and  the  empirical  distribution  for  the 
data.  The  estimated  and  empirical  distributions 
are  plotted  in  Fig.  3.  The  overall  model  fit  is 
reasonable;  however,  the  model-estimated  inci¬ 
dence  of  tumor  is  much  higher  than  the  observed 
incidence  for  ages  less  than  20  years. 

The  estimate  of  the  mutation  rate  from  the 
Poisson  model,  its  estimated  standard  error,  and 
the  value  of  the  log-likelihood  evaluated  at 
the  estimate  are  presented  in  Table  II.  The 
Poisson  model-estimated  distribution  function 
for  age  at  onset  of  the  first  tumor  cell  is  plotted 
in  Fig.  3  to  allow  for  comparison  with  the 
empirical  distribution  function  and  the  2-hit 
model.  The  fit  of  the  Poisson  model  to  the  data 
is  quite  poor. 
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Fig.  3.  Model-fitted  distribution  functions  from  three  models  and  empirical  distribution  function  for  data.  Cl,  confidence  interval. 


A  similar  display  of  model  parameter  estimates 
is  provided  in  Table  II  for  the  3-hit  model.  The 
model-estimated  distribution  function  for  the  3-hit 
model  is  also  included  in  Fig.  3.  The  model- 
estimated  distribution  function  fits  the  data  fairly 
closely  for  the  3-hit  model,  except  for  ages  less 
than  18. 

An  additional  method  of  comparing  the  fit  of 
the  models  to  the  data  is  to  use  the  Akaike 
information  criterion  (AIC)  [Sakamoto  et  al.,  1986], 
which  adds  a  penalty  term  for  the  number  of 
model  parameters  to  the  log-likelihood.  For  our 
summaries,  we  define  AIC  to  be  log(f)  -  p,  where 
£  is  the  likelihood  for  the  data  and  p  is  the  number 
of  model  parameters.  The  ordering  of  the  three 
models  according  to  this  criterion  is  consistent 
with  the  quality  of  model  fit  observed  in  Fig.  3: 
the  3-hit  model  provides  the  best  fit  to  the  data, 
the  2-hit  model  provides  the  next  best  fit,  and  the 
Poisson  model  provides  the  poorest  fit. 


DISCUSSION 

The  models  we  have  described  all  make  a 
common  assumption  regarding  the  process  of 


mutation  in  the  tissue:  that  mutations  can  only 
occur  during  cell  divisions  involved  in  the  process 
of  tissue  maintenance  once  the  tissue  has  reached 
its  maximum  size  of  300,000  cells.  This  implies 
that  cells  do  not  sustain  additional  mutations  in 
the  period  of  time  between  conception  and  about 
24  weeks  postconception,  when  the  pool  of 
normal  Schwann  cells  associated  with  the  vestib¬ 
ular  nerves  reaches  it  definitive  size  [Vizoso, 
1950].  This  assumption  is  not  true,  and  it  should 
be  possible  to  develop  a  model  in  which  cells  are 
at  risk  of  mutating  at  any  time  from  conception, 
through  determination  of  the  small  pool  of 
precursor  cells  that  gives  rise  to  the  Schwann  cells 
of  the  vestibular  nerve,  to  the  rapid  expansion  of 
this  pool  to  its  maximum  size,  and  afterwards  as 
these  cells  occasionally  divide  in  the  process  of 
cell  maintenance.  Moolgavkar  and  Luebeck  [1990] 
discussed  a  model  in  which  the  parameters  are 
assumed  to  vary  over  time.  Applying  models  of 
this  kind  to  our  data  would  be  of  interest,  and  will 
be  the  focus  of  further  work. 

Vestibular  schwannomas  arise  through  a  pro¬ 
cess  of  mutation  in  one  of  the  Schwann  cells 
associated  with  the  vestibular  nerves.  Our  as¬ 
sumed  value  for  the  number  of  Schwann  cells  in 
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this  normal  tissue  pool  is  300,000;  here  we  discuss 
the  sensitivity  of  the  results  to  this  assumption. 
We  refit  all  models  assuming  a  range  of  values  for 
this  parameter,  to  determine  the  effect  on  esti¬ 
mates  obtained  for  other  model  parameters  and 
fitted  distribution  functions.  We  tried  values 
between  200,000-400,000  for  all  models,  and  in 
all  cases  the  effect  on  the  fitted  distribution 
functions  was  negligible.  The  values  of  the  log- 
likelihoods  evaluated  at  the  parameter  estimates 
were  also  unchanged  by  these  perturbations  of  the 
assumed  number  of  cells  in  the  tissue.  The  actual 
values  of  the  other  parameter  estimates  did 
change  with  different  assumed  values  for  the 
number  of  cells  in  the  tissue.  For  the  Poisson 
model,  the  estimated  mutation  rates  varied  from 
8.35”8  to  1.67  x  10“6  for  400,000  to  200,000  tissue 
cells.  For  the  2-hit  model,  the  mutation  rate  varied 
between  2.82  x  10“8  to  5.65  x  10"8  for  400,000  to 
200,000  tissue  cells.  Although  the  estimates  for  the 
cell  division  and  death  parameters  also  varied 
across  fittings,  the  difference  between  these 
parameters,  i.e.,  the  net  cell  proliferation  rate, 
remained  roughly  constant.  For  the  3-hit  model, 
we  saw  mutation  rate  estimates  in  the  range  from 
8.73  xlO-5  to  6.20  x  1(T5  for  200,000  to  400,000 
tissue  cells.  Again,  the  cell  division  and  death  rate 
estimates  varied  across  fittings,  but  the  difference 
between  these  two  parameters  remained  roughly 
constant  across  the  fittings. 

In  summary,  we  do  not  feel  that  our  results  are 
sensitive  to  modest  perturbations  of  the  assumed 
number  of  Schwann  cells  in  the  normal  tissue  pool 
from  which  a  VS  arises.  The  rate-limiting  step  in 
the  development  of  VS  in  these  models  is  the 
mutation  rate.  Although  other  events,  including 
proliferation  of  the  single  mutated  tumor  cell  to 
form  a  mass  large  enough  to  be  detected  clinically, 
are  involved  in  tumor  formation,  we  assumed  that 
these  processes  do  not  vary  among  tumors.  This 
assumption  is  incorrect  but  should  not  influence 
our  conclusions  as  long  as  these  processes  are 
independent  of  the  variables  included  in  the 
model. 

The  Poisson  model  provides  a  poor  fit  to  the 
data,  but  the  2-hit  and  3-hit  models  fit  the  data 
well,  except  for  ages  less  than  18  years.  The  3-hit 
model  provides  a  marginally  better  fit  according 
to  Figure  3  and  the  Akaike  information  criterion. 
The  result  for  the  Poisson  model  is  not  surprising, 
as  it  estimates  only  a  single  parameter  from  the 
data.  Thus  the  Poisson  model  is  much  less  flexible 
than  the  other  two  models.  The  slightly  superior 
fit  of  the  3-hit  model  is  compatible  with  the 


possibility  that  the  pathogenesis  of  these  tumors 
requires  more  than  the  two  mutations  of  a 
Schwann  cell.  Other  tumors,  most  notably  colon 
cancer,  have  been  shown  to  develop  after  a 
multistep  pathogenesis.  Moolgavkar  and  Luebeck 
[1992]  used  models  similar  to  those  described  here 
to  model  the  multistep  pathogenesis  of  colon 
cancer  accurately.  Other  authors  also  discussed 
the  pathogenesis  of  colon  cancer  as  a  multistage 
process  [Chung,  2000;  Kinzler  and  Vogelstein, 
1996].  The  notion  that  more  than  two  mutations 
are  required  to  produce  a  schwannoma  in  NF2  is 
supported  by  molecular  genetic  studies  [Lamszus 
et  al.,  2000;  Bruder  et  al.,  1999].  There  is  clearly  a 
need  for  further  molecular  genetic  studies  to 
examine  the  effect  of  other  genes  on  the  develop¬ 
ment  of  NF2-associated  tumors. 

The  methodology  discussed  here  can  also  be 
applied  to  patient  data  for  other  NF2  tumors,  such 
as  meningiomas  and  epidymomas.  As  well,  it 
would  be  interesting  to  examine  the  effect  of 
incorporating  genotype  information  into  these 
models.  Genotype-phenotype  correlations  have 
been  observed  in  NF2,  and  the  inclusion  of  a 
patient's  genotype  may  result  in  a  model  that  fits 
the  data  more  closely. 
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of  NF2,  age  at  onset  of  hearing  loss,  and  number  of  intracranial  meningiomas  in 
390  NF2  nonprobands  from  153  unrelated  families.  A  significant  intrafamilial 
correlation  was  observed  for  each  of  the  three  features:  age  at  onset  (0.35;  95% 
confidence  interval  (Cl)  0.23-0.47),  age  at  onset  of  hearing  loss  (0.51;  95%  Cl, 
0.35-0.64),  and  number  of  meninginomas  (0.29;  95%  Cl,  0.15-0.43).  Significant 
correlations  were  also  observed  for  age  at  first  symptom  within  NF2  families  with 
truncating  mutations  (0.41;  95%  Cl,  0.06-0.68)  or  splice-site  mutations  (0.29; 

95%  Cl,  0.03-0.51),  for  age  at  onset  of  hearing  loss  within  families  with  missense 
mutations  (0.67;  95%  Cl,  0.18-0.89),  and  for  number  of  meningiomas  within 
families  with  splice-site  mutations  (0.39;  95%  Cl,  0.13-0.66).  Our  findings  are 
consistent  with  effects  of  both  allelic  and  nonallelic  familial  factors  on  the  clinical 
variability  of  NF2.  Genet.  Epidemiol.  23:245-259,  2002.  e  2002  Wiley-Liss,  Inc. 

Key  words:  intrafamilial  correlation;  random  effects  model;  right  censoring;  negative-binomial  gamma 
mixture  model;  neurofibromatosis  2 


INTRODUCTION 

Variable  expressivity  is  common  in  Mendelian  diseases,  especially  those  that  are 
transmitted  as  autosomal-dominant  traits.  Variable  expressivity  may  be  manifested 
in  many  different  ways,  including  variation  in  age  at  onset,  types  and  numbers  of 
clinical  features  that  develop,  overall  disease  severity,  rate  of  progression,  length  of 
course,  or  final  outcome.  Many  different  genetic  and  nongenetic  causes  of  variable 
expressivity  may  exist  and  act  alone  or  in  combination  [Scriver  and  Waters,  1999; 
Dipple  and  McCabe,  2000]. 

Random  effects  models  are  used  to  estimate  intraclass  and  intrafamilial 
associations  by  dividing  phenotypic  variance  into  components  that  are  attributable 
to  different  sources  of  variation.  Although  methods  based  on  sums  of  squares  are 
widely  used  to  estimate  these  variance  components,  this  approach  is  not  applicable 
when  censoring  is  present.  Moreover,  since  the  standard  random  effects  model  is 
based  on  normality  assumptions,  it  is  not  appropriate  when  the  data  are  discrete.  In 
this  paper,  we  extended  the  standard  random  effects  model  to  overcome  these 
limitations.  We  demonstrate  the  use  of  these  extended  models  by  analyzing  the 
familiarity  of  selected  clinical  features  of  neurofibromatosis  2  (NF2). 

NF2  is  a  highly  penetrant  Mendelian  disease  that  is  transmitted  as  an 
autosomal-dominant  trait.  The  incidence  of  NF2  at  birth  has  been  estimated  to  be 
between  1  in  33,000  and  1  in  40,000  [Evans  et  al.,  1992a].  Age  at  presentation  is 
usually  between  1 1-30  years,  although  younger  cases  and  diagnoses  in  the  fourth  and 
fifth  decades  also  occur  [Evans  et  al.,  1992a;  Parry  et  al.,  1994].  The  hallmark  of  NF2 
is  bilateral  vestibular  schwannomas  (VSs),  but  meningiomas,  nonvestibular 
schwannomas,  and  presenile  cataracts  are  also  common.  NF2  symptoms  are  usually 
related  to  “tumor  burden,”  i.e.,  the  number,  size,  and  location  of  tumors,  and  may 
include  hearing  loss,  tinnitus,  vertigo,  seizures,  facial  weakness,  and  visual 
impairment  [Evans  et  al.,  1992c;  Parry  et  al,  1994], 

The  responsible  gene,  NF2 ,  has  been  identified  and  sequenced  [Trofatter  et  al., 
1993;  Rouleau  et  al,  1993].  Pathogenic  mutations  have  been  found  throughout  the 
gene,  and  a  different  mutation  occurs  in  almost  every  family.  These  mutations  are  of 
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various  types,  but  most  can  be  classified  as  nonsense,  frameshift,  splice-site, 
missense,  or  large  deletions  [MacCollin,  1999], 

Clinical  studies  indicate  that  the  phenotypic  expression  and  natural  history  of 
NF2  tend  to  be  similar  within  a  family,  and  that  more  variability  occurs  between 
families  [Evans  et  al.,  1992a;  Parry  et  al.,  1994, 1996],  Previous  studies  demonstrated 
allele-phenotype  correlations  for  certain  NF2  mutation  classes.  In  general, 
constitutional  truncating  mutations  (frameshift  or  nonsense)  are  associated  with' 
severe  disease,  missense  mutations  and  large  deletions  with  milder  disease,  and 
splice-site  mutations  with  variable  disease  severity,  although  exceptions  do  occur 
[Kluwe  et  al.,  1996,  1998;  Parry  et  al.,  1996;  Ruttledge  et  al.,  1996;  Evans  et  al., 
1998a], 

Despite  the  general  similarity  in  disease  severity  among  affected  relatives, 
substantial  phenotypic  differences  may  occur  within  families  [Mautner  et  al.,  1996; 
Baser  et  al.,  1996b].  It  is  not  known  whether  this  variability  occurs  by  chance  or  is 
caused  by  modifying  genes  at  other  loci  [Bruder  et  al.,  1999],  coincident 
environmental  exposures,  or  some  combination  of  factors  [Baser  et  al.,  1996b]. 

We  developed  statistical  methods  to  estimate  the  magnitude  of  intrafamilial 
correlations  for  continuous  variables  with  censored  observations  and  for  count 
variables.  We  used  these  methods  to  test  whether  the  phenotypic  similarities  found 
among  relatives  with  NF2  can  be  explained  entirely  by  the  recognized  NF2  mutation 
class-phenotype  correlation.  We  calculated  intrafamilial  correlation  coefficients  ( x ) 
for  three  clinical  features  (age  at  first  symptom,  age  at  onset  of  hearing  loss,  and 
number  of  intracranial  meningiomas)  for  a  large  series  of  NF2  patients  and  within 
subgroups  of  patients  with  truncating  mutations,  splice-site  mutations,  missense 
mutations,  or  large  deletions  of  the  NF2  gene.  We  demonstrate  significant 
intrafamilial  correlations  for  each  of  these  phenotypic  features  within  the  entire 
group  of  NF2  patients  and  in  one  or  more  subgroups  of  patients  with  a  particular 
class  of  constitutional  NF2  mutations.  Our  findings  suggest  that  familial  factors 
beyond  NF2  mutation  class  are  important  in  the  pathogenesis  of  these  features  in 
some  patients  with  NF2. 

MATERIALS  AND  METHODS 
Statistical  Analysis 

Random  effects  model  for  censored  data  ' 

In  a  random  effects  model,  the  total  variance  for  a  variable  can  be  separated 
into  two  components:  variance  between  families  and  variance  within  a  family 
(erjy).  Let  k  be  the  number  of  families  in  the  study,  n,  be  the  number  of  affected 
members  in  the  ith  family,  and  Yv  be  the  value  of  the  yth  patient  of  the  rth  family. 
The  statistical  model  is 

Yv  =  n  +  Ai  +  sy,  i=  1  ,...k;j=  1  ,...««, 

where  A, ,s  are  independent  normal  random  variables  with  mean  0  and  variance  g2b; 
£y-s  are  also  independent  random  variables  with  mean  0  and  variance  a2w-  A,s  and  SyS 
are  mutually  independent.  In  the  above  model,  /t  represents  the  overall  mean  of  all 
the  individuals;  At  is  common  to  all  the  members  from  the  same  family,  representing 
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the  deviation  of  the  mean  of  this  particular  family  from  the  overall  mean  pi.  The 
variance  of  Ah  cr|,  reflects  the  between-family  variation,  and  the  variance  of  %,  a2Wt 
reflects  the  within-family  variation.  The  total  variance  a2  is  the  sum  of  <r|  and  ajy. 
When  the  feature  is  relatively  homogenous  within  families,  will  be  small  in 
comparison  to  the  total  variance.  Therefore,  the  strength  of  intrafamilial 
resemblance  can  be  measured  by  the  ratio  of  the  between-family  variance  to  the 
total  variance:  z=a2Bl(a2B  +  o]y),  i.e.,  the  intrafamilial  correlation. 

A  widely  used  procedure  for  estimating  variance  components  is  to  equate  sums 
of  squares  to  their  expected  values;  this  approach  is  not  applicable  when  the  variable 
under  consideration  is  subject  to  right  censoring.  Therefore,  we  used  maximum 
likelihood  estimation  (MLE)  to  incorporate  survival  techniques  into  a  random 
effects  model.  Each  family  in  the  study  contributes  one  term  to  the  likelihood 
function.  For  an  individual  who  has  developed  the  age-dependent  feature,  we 
calculate  the  instantaneous  likelihood  that  the  feature  occurs  at  the  observed  onset 
age;  for  an  individual  who  does  not  have  the  feature,  we  calculate  the  likelihood  that 
the  feature  occurs  beyond  the  patient’s  current  age.  For  the  ith  family,  let  1)  be  the 
subgroup  of  all  the  individuals  with  the  feature  and  C,  the  subgroup  of  all  individuals 
without  the  feature,  yy  is  the  age  at  onset  of  the  feature  if  it  is  present;  otherwise,  yy 
is  the  patient’s  age  at  last  examination.  The  contribution  of  the  family  to  the 
likelihood  is 

Pi  =fT,(yy,  j  e  T;)Pr(7y-  >yiy,  f  e  C,|  Yy  =  yy,  j  e  Tt), 

where  fTl  is  the  joint  density  of  {  Yy,  j  e  T,},  and  the  second  term  on  the  right-hand 
side  is  the  conditional  probability  of  {Yif>yif,  feC,}  given  {YIJ=yiJ,  jeTfi.  Pi  is 
parametrized  as  a  function  of  pi,  <?  and  z  [Jobson,  1996].  The  log-likelihood  Slog  (Pt) 
can  be  maximized  numerically  with  a  quasi-Newton  method  (e.g.,  Nash,  1990)  to 
obtain  the  maximum  likelihood  estimates  of  pi,  a1  and  z  together  with  an  estimated 
covariance  matrix. 

We  applied  this  method  to  data  for  two  continuous  variables  available  on  NF2 
patients:  age  at  first  symptom  and  age  at  onset  of  hearing  loss.  For  age  at  first 
symptom,  censoring  is  present  when  a  patient  is  asymptomatic  at  the  time  of 
examination  or  death;  for  age  at  onset  of  hearing  loss,  censoring  occurs  when  a 
patient  does  not  have  hearing  loss  at  the  time  of  examination  or  death. 


Random  effects  model  for  discrete  data 

A  random  effects  model  based  on  a  normal  distribution  is  not  realistic  for  a 
count  variable  with  a  high  frequency  of  zeros,  such  as  number  of  meningiomas  in  a 
patient  with  NF2.  We  considered  using  a  Poisson  distribution  to  model  these  data, 
but  the  mean  and  variance  are  equal  in  the  Poisson  distribution.  In  contrast,  the 
within-family  variation  is  greater  than  the  mean  in  the  NF2  meningioma  data.  We 
used  a  negative-binomial  gamma  mixture  model,  based  on  the  assumption  that  the 
expected  count  may  differ  between  families  as  well  as  within  a  single  family.  The 
similarity  within  families  is  represented  by  a  factor  with  a  gamma  distribution.  For 
any  given  family,  the  count  in  each  member  follows  a  negative-binomial  distribution 
[Lawless,  1987]  conditional  on  the  familial  factor. 
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Suppose  in  the  rth  family,  Yy  is  the  count  in  the  Jth  member.  We  assume  that  the 
family  factor  A/  is  an  unobserved  random  variable  having  a  gamma  distribution  with 
mean  1  and  variance  1/6.  Conditionally  on  A,-,  YyS  are  independent  and  have  a 
negative-binomial  distribution  with  mean  py  =  p0Ab  where  p0  is  the  overall  mean 
count  across  all  the  families.  Given  Hy  and  another  parameter  A,  the  probability 
function  of  the  negative-binomial  distribution  is  fully  specified  as 


Pr  (Yij  =  y)  = 


rq+y)^1 

r(A)y\(iiij  +  X)x+y 


Since  the  family  factor  At  is  a  random  variable,  the  count  per  patient  varies  from 
family  to  family.  A  large  variance  of  At  implies  that  the  families  are  very  different  in 
their  means.  The  correlation  between  two  particular  family  members,  t,  depends  on 
6,  A,  and  po : 


T  _  £ M 

fiyOX  +  /Xq(1  +  6  4-  A) 

The  mean  fly  can  also  be  allowed  to  depend  on  covariates  through  a  log  link 
function.  Let  Xy  be  a  vector  of  covariates  and  /?  the  vector  of  coefficients,  then 
Hy = AiCxp(xyP).  The  correlation  between  two  particular  family  members  is  no  longer 
a  constant,  but  instead  depends  on  their  x-values.  If  the  covariate  Values  of  two 
family  members  are  Xy  and  xy,  the  correlation  between  them  is: 

+  jU?(l  +  0  +  X)\njiQX  +  pj, ( 1  +  8  4-  A)] 


where  pk=exp(xik[l),  k=j  or /. 

Note  that  in  the  gamma  negative-binomial  model,  the  variance  cannot  be 
partitioned  into  additive  components.  The  variance  of  the  familial  factor  is  1  /6  and 
the  conditional  variance  of  the  individual  factor  depends  on  the  dispersion 
parameter  1/A.  They  are  not  additive  because  there  is  additional  variation  from 
the  Poisson  sampling  that  depends  on  the  mean. 

We  used  this  negative-binomial  gamma  mixture  model  to  assess  familiality  of 
meningioma  count  data  in  NF2  patients.  The  maximum  likelihood  estimates  of  6,  A, 
and  no,  together  with  an  estimated  covariance  matrix,  were  obtained  numerically 
using  a  quasi-Newton  method  [Nash,  1990],  and  the  standard  error  of  x  was  derived 
by  the  delta  method  [Agresti,  1990],  It  would  be  appropriate  to  include  covariates 
such  as  age,  but  this  information  was  unavailable  for  many  patients  in  our  data  set. 
Therefore,  no  covariates  were  included  in  the  analysis  presented  below. 


Genotype-phenotype  correlations 

The  constitutional  NF2  mutation  was  known  in  a  subset  of  the  families,  and  this 
permitted  us  to  assess  whether  the  NF2  allele-phenotype  correlation  accounts  for  all 
of  the  intrafamilial  correlation  observed.  Patients  belonging  to  families  with  each  of 
the  following  four  kinds  of  NF2  constitutional  mutations  were  analyzed  separately: 
1)  truncating  mutations  (frameshift  or  nonsense),  2)  splice-site  or  splice  effect 
mutations,  3)  missense  mutations,  and  4)  large  deletions. 
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TABLE  I.  Number  of  NF2  Families  and  Patients  Included  for  Each  of  the  Clinical  Features  Examined 


Mutation  type  and  clincal  feature 

Number  of  families 

Number  of  patients 

All  mutation  types 

Age  at  first  symptom 

150 

373 

Age  of  onset  of  hearing  loss 

114 

261 

Number  of  intracranial  meningiomas 

122 

259 

Truncating  (nonsense  or  frameshift) 

Age  at  first  symptom 

37 

58 

Age  of  onset  of  hearing  loss 

25 

39 

Number  of  intracranial  meningiomas 

30 

44 

Splice-site 

Age  at  first  symptom 

32 

101 

Age  at  onset  of  hearing  loss 

23 

60 

Number  of  intracranial  meningiomas 

27 

79 

Missense 

Age  at  first  symptom 

12 

50 

Age  at  onset  of  hearing  loss 

9 

38 

Number  of  intracranial  meningiomas 

10 

23 

Large  deletions 

Age  at  first  symptom 

13 

42 

Age  at  onset  of  hearing  loss 

11 

34 

Number  of  intracranial  meningiomas 

12 

36 

Intrafamilial  correlation  coefficients  were  calculated  within  subsets  of  families 
who  shared  similar  constitutional  NF2  mutation  types.  To  demonstrate  the  NF2 
genotype-phenotype  correlations,  we  also  compared  the  means  of  each  pair  of 
mutation  subgroups  simultaneously.  The  Bonferroni  method  [Seber,  1977]  was  used 
to  control  the  type  I  error  in  these  multiple  comparisons.  The  z-score  was  calculated 
for  the  difference  between  each  pair  of  means,  but  only  those  with  P- value  <  dk  were 
considered  to  be  statistically  significant,  where  a  was  chosen  as  0.05,  and  k  is  the  total 
number  of  pairs  tested  (6  in  this  instance). 

Patients 

Three  hundred  and  ninety  patients  from  153  families  were  ascertained  from 
both  published  and  unpublished  sources  (Supplemental  information  can  be  found  at 
http://www.interscience.wiley.com/jpages/0741-0395/suppmat/index.html).  All  pa¬ 
tients  included  met  the  Manchester  clinical  diagnostic  criteria  for  NF2  [Evans  et 
al.,  1992b],  had  an  identified  constitutional  NF2  mutation,  or  both.  Probands  were 
excluded  from  the  table  and  from  all  statistical  analyses  to  avoid  ascertainment  bias. 
All  other  affected  individuals  were  included  if  clinical  information  was  available  for 
at  least  1  of  the  3  manifestations  studied:  age  at  first  symptom,  age  at  onset  of 
hearing  loss,  or  number  of  intracranial  meningiomas.  These  variables  were  examined 
because  they  were  the  most  reliably  reported  features  across  the  various  data  sources 
used  for  the  study.  Meningiomas  were  identified  by  cranial  CT  or  MRI  scan.  Only 
intracranial  meningiomas  were  considered  in  this  study.  The  total  numbers  of 
families  and  patients  used  to  examine  each  clinical  feature  are  given  in  Table  I. 
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Age  at  first  symptom  of  NF2  and  age  at  onset  of  hearing  loss  are  both  subject  to 
right  censoring.  Censoring  can  occur  either  because  the  manifestation  was  not 
present  at  the  time  of  last  evaluation,  or  because  the  manifestation  was  not  present 
when  the  subject  died.  Death  accounts  for  a  small  proportion  of  censored  cases  in 
this  data  set. 


RESULTS 

Among  the  390  NF2  patients  included  in  this  study,  300  (76.9%)  had  bilateral 
VSs,  31  (7.9%)  had  a  unilateral  VS,  26  (6.7%)  had  no  VS,  and  in  33  cases  (8.5%)  the 
VS  status  was  unknown. 

Age  at  First  Symptom 

Three  hundred  and  seventy-three  patients  from  150  families  were  included  in  the 
study  of  age  at  first  symptom.  Seventy-two  (19%)  patients  were  asymptomatic  at 
time  of  last  examination  or  death,  and  were  therefore  treated  as  right-censored  cases 
in  this  analysis.  Among  the  symptomatic  patients,  age  at  first  symptom  ranged  from 
1-62  years. 

To  assess  the  assumption  of  normality  for  age  at  first  symptom,  we  examined 
normal  probability  plots  for  all  subjects  together  and  for  subjects  in  each  mutation 
subclass.  These  plots  did  not  show  extreme  skewness,  except  in  the  subclass  of 
patients  with  large  deletion  mutations,  where  the  distribution  was  skewed  to  the 
right.  The  random  effects  model  was  also  fit  in  this  subgroup,  using  log-transformed 
age  at  first  symptom.  The  estimate  of  x  was  about  the  same,  so  only  the  results  of  the 
model  using  untransformed  values  of  age  are  reported  here. 

Table  II  shows  the  means,  standard  deviations,  and  intrafamilial  correlations 
calculated  for  affected  members  of  all  families  included  in  this  study,  as  well  as  for 
members  of  families  with  each  of  four  types  of  constitutional  NF2  mutations: 
truncating  mutations,  splice-site  mutations,  missense  mutations,  and  large  deletions. 
The  value  of  x  within  each  subgroup  of  mutations  except  large  deletions  was  similar 
in  magnitude  to  that  seen  when  all  families  were  analyzed  together.  For  all  NF2 
mutations  considered  together,  the  intrafamilial  correlation  coefficient  for  age  at  first 
symptom  was  0.35,  and  the  lower  95%  confidence  limit  was  0.23.  The  95% 


TABLE  II.  Mean,  Standard  Deviation,  and  IntrafamiUal  Correlation  of  Age  at  First  Symptom  in  373  NF2 
Patients  From  150  Families® 


Mutation  type 

Censoring  rate 

Mean 

Standard 

deviation 

Intrafamilial 
correlation  (t) 

All  mutation  types 

19% 

24.9  (23.1,  26.8) 

13.1  (12.0,  14.2) 

0.35  (0.23,  0.47) 

Truncating  (nonsense 
or  frameshift) 

15% 

18.7  (15.6,  21.9) 

9.5  (7.6,  11.5) 

0.41  (0.06,  0.68) 

Splice-site 

19% 

25.1  (21.6,  28.5) 

12.1  (10.0,  14.2) 

0.29  (0.03,  0.51) 

Missense 

20% 

29.3  (24.1,  34.6) 

11.9  (8.9,  14.9) 

0.32  (0,  0.61) 

Large  deletions 

14% 

24.5  (20.0,  29.0) 

11.3  (8.7,  14.0) 

0.10  (0,  0.34) 

^Approximate  95%  confidence  intervals  of  point  estimates  are  given  in  parentheses. 
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confidence  intervals  for  r  were  always  wider  in  the  subgroups,  as  expected  with 
smaller  sample  sizes.  Nevertheless,  in  two  of  the  subgroups  (truncating  mutations 
and  splice-site  mutations),  the  lower  limit  of  the  95%  confidence  interval  of  z 
excluded  0. 

We  conducted  pairwise  tests  to  assess  differences  between  mean  ages  at  first 
symptom  in  the  subgroups,  and  tested  nominal  statistical  significance  using  the 
Bonferroni  method.  The  mean  age  at  first  symptom  in  the  subgroup  with  truncating 
mutations  was  significantly  different  from  the  mean  age  at  first  symptom  in  the 
splice-site  and  missense  subgroups,  whereas  the  differences  between  all  other  pairs 
were  not  statistically  significant.  Patients  with  truncating  mutations  had  an  earlier 
mean  age  at  first  symptom  (18.7  years)  and  less  variation  (standard  deviation,  9.5 
years).  The  pattern  of  age  at  first  symptom  is  shown  more  clearly  in  the  Kaplan- 
Meier  estimates  of  the  proportion  of  asymptomatic  patients  at  various  ages  for  each 
mutation  type  (Fig.  1). 


Age  at  Onset  of  Hearing  Loss 

Of  261  NF2  patients  from  1 14  families  for  whom  hearing  status  was  known,  192 
individuals  (74%)  had  lost  their  hearing  at  the  time  of  examination.  The  age  at  onset 
of  hearing  loss  among  these  patients  ranged  from  3-62  years.  Sixty-nine  (26%)  of  the 
261  patients  did  not  have  hearing  loss  at  the  time  of  last  examination  or  death,  and 
were  treated  as  right-censored  in  the  analysis. 


Fig.  1.  Kaplan-Meier  estimates  of  probability  of  remaining  asymptomatic  at  a  given  age.  [Color  figure 
can  be  viewed  in  the  online  issue,  which  is  available  at  www.interscience.wiley.com] 
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TABLE  HI.  Mean,  Standard  Deviation,  and  Intrafamilial  Correlation  of  Age  at  Onset  of  Hearing  Loss  for 
261  NF2  Patients  From  114  Families* 


Mutation  type 

Censoring 

rate 

Mean 

Standard 

deviation 

Intrafamilial 
correlation  (t) 

All  mutation  types 

26% 

29.6  (27.2,  32.1) 

13.3  (11.8,  14.9) 

0.51  (0.35,  0.64) 

Truncating  (nonsense 
or  frameshift) 

23% 

22.2  (19.3,  25.1) 

7.4  (5.5,  9.4) 

0.41  (0,  0.76) 

Splice-site 

40% 

31.6  (27.4,  35.9) 

11.7  (8.9,  14.5) 

0.29  (0,  0.62) 

Missense 

24% 

36.9  (27.4,  46.4) 

14.9  (8.1,21.7) 

0.67  (0.18,  0.89) 

Large  deletions 

26% 

28,9  (22.2,35.6) 

13.0  (9.2,  16.8) 

0.19  (0,  0.55) 

“Approximate  95%  confidence  intervals  of  the  point  estimates  are  given  in  parentheses. 


To  assess  the  assumption  of  normality  for  age  at  onset  of  hearing  loss,  we 
examined  normal  probability  plots  for  all  patients  together  and  for  each  mutation 
subclass.  The  distribution  for  all  cases  together  was  not  skewed,  but  right  skewness 
was  observed  for  all  subclasses  except  truncating  mutations.  A  logarithmic 
transformation  provided  a  better  fit  for  the  subgroups  that  had  a  skewed 
distribution,  but  the  estimates  of  t  remained  almost  the  same  as  without  the 
transformation.  For  this  reason,  we  only  report  results  for  the  analysis  without 
transformation  of  age. 

The  means,  standard  deviations,  and  intrafamilial  correlations  for  age  at  onset 
of  hearing  loss  are  reported  in  Table  III,  and  the  Kaplan-Meier  estimates  are  plotted 
in  Figure  2.  A  strong  intrafamilial  correlation  was  seen  for  age  at  hearing  loss  when 
all  patients  were  considered  together  (t  =  0.51;  95%  Cl,  0.35-0.64).  Within  the 
subgroups  defined  by  constitutional  NF2  mutation  type,  those  with  missense 
mutations  had  a  somewhat  higher  intrafamilial  correlation  than  the  other  subgroups, 
and  it  was  only  in  this  subgroup  that  the  95%  confidence  interval  of  the  correlation 
coefficient  excluded  zero. 

Pairwise  tests  showed  that  the  mean  age  at  onset  of  hearing  loss  for  patients 
with  truncating  mutations  was  significantly  lower  than  that  of  patients  with  splice- 
site  or  missense  mutations.  The  means  of  the  other  subgroups  did  not  differ 
significantly  from  each  other. 

Number  of  Intracranial  Meningiomas 

Two  hundred  and  fifty-nine  NF2  patients  from  122  families  were  used  in  the 
study  of  intracranial  meningiomas.  The  distribution  of  number  of  meningiomas  is 
summarized  in  Table  IV  by  mutation  type,  and  estimates  of  the  model  parameters 
and  intrafamilial  correlations  are  reported  in  Table  V. 

The  mean  number  of  intracranial  meningiomas  per  patient  was  1.01  (95%  Cl, 
0.70-1.32).  A  significant  intrafamilial  correlation  for  number  of  meningiomas  was 
observed  for  all  NF2  patients  combined  (t  =  0.29;  95%  Cl,  0.15-0.43). 

NF2  patients  with  truncating  mutations  had  the  highest  mean  number  of 
meningiomas,  i.e.,  1.92  (95%  Cl,  1.02-2.82),  but  this  was  associated  with  relatively 
high  within-family  variance.  The  magnitude  of  the  intrafamilial  correlation 
coefficient  was  small  in  this  subgroup. 
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Fig.  2.  Kaplan-Meier  estimates  of  probability  of  not  developing  hearing  loss  by  a  given  age.  [Color  figure 
can  be  viewed  in  the  online  issue,  which  is  available  at  www.interscience.wiley.com] 


TABLE  IV.  Distribution  of  Number  of  Meningiomas  in  259  NF2  Patients  From  122  Families 


Mutation  type 

Frequency  of  number  of  meningiomas 

0 

1 

2 

3 

4 

5 

6 

7 

9 

10 

19 

All  mutation  types 

164 

50 

19 

8 

6 

5 

2 

1 

2 

1 

1 

63.3% 

19.3% 

7.3% 

3.1% 

2.3% 

1.9% 

0.8% 

0.4% 

0.8% 

0.4% 

0.4% 

Truncating51 

18 

9 

4 

4 

4 

1 

1 

1 

1 

1 

0 

40.9% 

20.5% 

9.1% 

9.1% 

9.1% 

2.3% 

2.3% 

2.3% 

2.3% 

2.3% 

Splice-site 

50 

19 

6 

1 

1 

1 

0 

0 

1 

0 

0 

63.3% 

24.1% 

7.6% 

1.3% 

1.3% 

1.3% 

1.3% 

Missense 

16 

6 

0 

1 

0 

0 

0 

0 

0 

0 

0 

70.0% 

26.1% 

4.3% 

Large  deletions 

25 

4 

6 

0 

0 

0 

0 

0 

0 

0 

1 

69.4% 

11.1% 

16.7% 

2.8% 

aNonsense  and  frameshift. 

The  mean  number  of  intracranial  meningiomas  among  NF2  patients  with 
splice-site  mutations  was  0.72  (95%  Cl,  0.29-1.15).  In  contrast  to  the  situation  with 
truncating  mutations,  the  within-family  variation  was  small,  and  the  between-family 
variation  was  large  for  NF2  patients  with  splice-site  mutations.  The  point  estimate  of 
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TABLE  V.  Parameter  Estimates  and  Intrafamilial  Correlation  Coefficients  for  Number  of  Meningiomas* 


Mutation  type 

l/0b 

Iff 

Mean  Qio ) 

Intrafamilial 
correlation  (t) 

All  mutation  types 

1.24  (0,53,  1.95) 

0.93  (0.32,  1.54) 

1.01  (0.70,  1.32) 

0.29  (0.15,  0.43) 

Truncating  (nonsense 
and  frameshift) 

0.29  (0.00,  0.58) 

1.19(0.17,  2.21) 

1.92(1.02,  2.82) 

0.12  (0,00,  0.25) 

Splice-site 

1.43  (0.14,  2.72) 

0.34  (0,  0.89) 

0.72(0.29,  1.15) 

0.39(0.13,  0.66) 

Missense 

0.28  (0,  0.69) 

0.47  (0,  2.45) 

0.42  (0.09,  0.75) 

0.08  (0,  0.23) 

Large  deletions 

1.25  (0,  3.94) 

2.37  (0,  5.96) 

0.96  (0,  2.02) 

0.16  (0,  0.45) 

Approximate  95%  confidence  intervals  for  the  point  estimates  are  given  in  parentheses. 
bl /8,  variance  between  families. 

°1/A,  negative  binomial  dispersion  parameter. 


the  intrafamilial  correlation  coefficient  was  higher  in  this  subgroup  than  in  any  of  the 
other  mutation  subgroups,  and  the  95%  confidence  interval  excluded  zero. 

The  mean  number  of  intracranial  meningiomas  among  NF2  patients  with 
missense  mutations  was  0.42  (95%  Cl,  0.09-0.75),  the  lowest  among  the  four 
mutation  subgroups  because  16  of  the  23  patients  in  this  subgroup  had  no 
meningiomas.  The  variation  both  between  families  and  within  a  family  was  similar  in 
magnitude  to  the  mean,  and  the  intrafamilial  correlation  coefficient  was  small. 

The  mean  number  of  intracrainal  meningiomas  among  NF2  patients  with  large 
deletions  was  0.96  (95%  Cl,  0—2.02).  The  within-  and  between-family  variances  were 
both  large,  mainly  because  of  one  patient  (patient  201  in  family  1648)  who  developed 
19  meningiomas  by  age  18  [Bruder  et  al.,  2001],  More  than  2/3  of  individuals  with 
this  mutation  type  had  no  meningiomas,  and  all  of  the  others  had  either  one  or  two 
meningiomas,  including  patient  201’s  two  affected  relatives.  When  this  family  was 
excluded  from  the  analysis,  the  mean  number  of  meningiomas  among  the  remaining 
patients  with  large  deletions  was  0.39,  both  within-  and  between-family  variation 
were  much  smaller,  and  the  intrafamilial  correlation  was  even  lower  (0.06). 


DISCUSSION 
Statistical  Methods 

The  statistical  methods  used  here  should  be  of  use  in  intrafamilial  correlation 
studies  of  other  genetic  diseases.  Random  effects  models  are  commonly  used  to 
analyze  intraclass  and  intrafamilial  correlations  in  continuous  traits,  and  we 
extended  this  method  to  include  right-censored  data.  The  maximum  likelihood 
method  we  describe  can  also  accommodate  two  other  types  of  censoring  frequently 
associated  with  age-related  traits:  left  censoring  (e.g.,  the  event  occurred  before  time 
of  examination)  and  interval  censoring  (e.g.,  the  event  occurred  between  two 
examinations).  A  mixed-effects  model  can  be  used  to  adjust  the  correlations 
calculated  by  this  method  for  covariates  [Searle  et  al.,  1992], 

The  negative-binomial  gamma  mixture  model  we  developed  for  count  traits  is 
also  likely  to  be  useful  for  other  genetic  diseases.  A  Poisson  mixture  model 
is  sometimes  used  with  count  data  [Foulley  et  al.,  1987],  but  the  Poisson  distribution 
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is  constrained  because  the  variance  is  equal  to  the  mean.  A  mixture  model  based  on 
the  negative-binomial  distribution  allows  more  flexibility,  and  is  therefore  more 
appropriate  for  count  variables  with  overdispersion  relative  to  the  Poisson 
distribution  [Tempelman  and  Gianola,  1996], 

Intrafamilial  Correlations  in  NF2 

Phenotypic  variability  is  observed  in  individuals  with  NF2,  both  within  and 
between  families.  We  employed  a  random  effects  model  incorporating  survival 
techniques  to  estimate  intrafamilial  correlations  in  two  continuous  variables  that  are 
right-censored:  age  at  first  symptom,  and  age  at  onset  of  hearing  loss.  We  used  a 
negative-binomial  gamma  mixture  model  to  estimate  intrafamilial  correlations  for  a 
discrete  variable,  i.e.,  number  of  intracranial  meningiomas.  Our  results  demonstrate 
that  relatives  with  NF2  are  more  similar  to  each  other  than  to  unrelated  affected 
individuals  with  respect  to  each  of  these  clinical  features.  These  observations  are 
consistent  with  anecdotal  clinical  experience  [Evans  et  al.,  2000].  Parry  et  al.  [1996] 
adjusted  for  intrafamilial  correlation  in  their  genotype-phenotype  analysis,  but  the 
intrafamilial  correlation  of  NF2  phenotypes  has  not  previously  been  tested 
statistically. 

Intrafamilial  correlations,  such  as  those  observed  in  this  study,  may  have  a 
variety  of  causes.  Effects  of  the  mutant  allele,  of  other  shared  genes,  of  shared 
environmental  factors,  or  of  a  combination  of  genetic  and  environmental  factors 
may  produce  such  correlations.  Distinguishing  between  these  possibilities  requires 
analysis  of  phenotypic  correlations  among  affected  family  members  of  various 
classes,  such  as  monozygotic  twins,  sibs,  parent-child  pairs,  and  more  distant 
relatives. 

Since  all  affected  individuals  in  the  same  family  can  be  presumed  to  carry  the 
same  constitutional  alteration  of  the  NF2  locus,  the  nature  of  the  NF2  mutation  itself 
might  account  for  the  familiality  we  observed.  This  possibility  is  supported  by  the 
associations  observed  in  cross-sectional  studies  between  allele  class  and  disease 
severity  in  NF2  [Kluwe  et  al.,  1996,  1998;  Parry  et  al.,  1996;  Ruttledge  et  al.,  1996; 
Evans  et  al.,  1998a],  Our  study  includes  data  on  patients  who  are  also  included  in 
these  earlier  studies,  and  as  expected,  we  found  similar  effects. 

We  also  observed  intrafamilial  correlations  of  similar  or  greater  magnitude  for 
each  of  the  features  studied  in  subgroups  of  patients  who  all  had  the  same  type  of 
constitutional  NF2  mutation.  While  it  is  possible  that  specific  allelic  differences 
within  each  mutation  class  account  for  these  intrafamilial  correlations,  our  findings 
could  also  reflect  the  effects  of  modifying  genes.  Recent  reports  of  putative 
modifying  loci  for  NF2  are  consistent  with  this  interpretation  [Bruder  et  al.,  1999; 
Goutebroze  et  al.,  2000],  Several  genes  other  than  NF2  have  been  implicated  in 
meningioma  development,  including  loci  on  chromosomes  1,  3,  6,  7,  and  22  [Sanson 
et  al.,  1993;  Sulman  et  al.,  1998;  Comtesse  et  al.,  1999],  but  the  contribution  of  these 
loci  to  the  interfamilial  variability  observed  in  NF2  pedigrees  is  unknown. 

Our  studies  are  subject  to  several  limitations.  We  used  data  from  a  variety  of 
sources,  and  differences  in  referral  patterns,  diagnostic  acumen,  and  criteria  for 
diagnosis  probably  exist  among  the  centers.  Age  at  first  symptom  and  age  at  onset  of 
hearing  loss  were  taken  from  published  data  (including  updated  data  provided  by  the 
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authors)  and  unpublished  data.  The  definitions  of  these  ages  may  vary  from  source 
to  source.  In  many  cases,  age  at  first  symptom  and  age  at  onset  of  hearing  loss  were 
assigned  retrospectively  and  thus  may  be  subject  to  recall  errors.  All  these  factors 
could  affect  the  accuracy  of  our  results. 

Although  this  study  was  based  on  the  largest  collection  of  clinical  data  available 
on  NF2  patients,  consideration  of  separate  mutation  types  was  limited  by  small 
sample  sizes.  Consequently,  our  estimates  of  t  for  the  subgroups  are  associated  with 
wide  confidence  intervals.  Some  of  the  correlations  that  did  not  appear  to  be 
significant  in  this  study  might  be  important,  but  require  larger  samples  for 
demonstration.  A  likelihood  ratio  test  could  be  performed  to  assess  whether 
intrafamilial  correlations  vary  significantly  with  mutation  type.  This  would  be  useful 
in  a  study  that  included  more  patients,  but  a  likelihood  ratio  test  would  not  show 
significant  differences  because  of  the  small  sample  size  of  each  subgroup  in  the 
present  study. 

The  penetrance  of  NF2  and  the  prevalence  of  individual  tumor  types  generally 
increase  with  age  [Mautner  et  al.,  1993;  MacCollin  and  Mautner,  1998].  Time  from 
onset  of  symptoms  may  also  influence  the  number  of  meningiomas  in  an  NF2 
patient,  so  it  would  be  appropriate  to  model  this  time  variable  as  an  additional 
source  of  variation  that  is  independent  of  the  familial  factor.  Unfortunately,  we  did 
not  know  the  age  at  which  meningioma  status  was  determined  for  many  of  the 
patients  in  this  study,  so  we  could  not  include  age  as  a  covariate  in  our  analysis. 

Statistical  techniques  provide  powerful  means  of  studying  genetic  and 
nongenetic  aspects  of  diseases  such  as  NF2.  Methods  are  needed  to  estimate 
intrafamilial  correlations  for  other  kinds  of  non-normally  distributed  traits,  such  as 
ordered  categorical  data  (e.g.,  severity  of  disease)  and  continuous  data  that  are  not 
normally  distributed  (e.g.,  disease  progression  rate).  Each  of  these  data  types 
requires  a  different  statistical  model  to  capture  specific  distributional  features. 
Models  that  allow  a  wide  range  of  dependent  structures,  so  that  various  genetic  and 
environmental  components  of  phenotypic  variation  can  be  assessed  at  the  same  time, 
are  especially  desirable. 
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